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Enterococcus faecalis polynucleotides and polypeptides 

Field of the Invention 

The present invention relates to novel Enterococcus faecalis genes (E. faecalis) 
5 nucleic acids and polypeptides. Also provided are vectors, host cells and recombinant 
methods for producing the same. Further provided are diagnostic methods for 
detecting Enterococcus faecalis using probes, primers, and antibodies to the E. faecalis 
nucleic acids and polypeptides of the present invention. The invention further relates 
to screening methods for identifying agonists and antagonists of E. faecalis 
10 polypeptide activity and to vaccines using E. faecalis nucleic acids and polypeptides. 

Background of the Invention 

Enterococci have been recognized as being pathogenic for humans since the 
turn of the century when they were first described by Thiercelin in 1988 as 

15 microscopic organisms. The genus Enterococcus includes the species Enterococcus 
faecalis or E. faecalis which is the most common pathogen in the group, accounting for 
80 - 90 percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin 
Microbiol Infect Dis.9: 111-117. 

The incidence of enterococcal infections has increased in recent years and 

20 enterococci are now the second most frequently reported nosocomial pathogens. 

Enterococcal infection is of particular concern because of its resistance to antibiotics. 
Recent attention has focused on enterococci not only because of their increasing role in 
nosocomial infections, but also because of their remarkable and increasing resistance to 
antimicrobial agents. These factors are mutually reinforcing since resistance allows 

25 enterococci to survive in an environment in which antimicrobial agents are heavily 
used; the hospital setting provides the antibiotics which eliminate or suppress 
susceptible bacteria, thereby providing a selective advantage for resistant organisms, 
and the hospital also provides the potential for dissemination of resistant enterococci 
via the usual routes of hand and environmental contamination. 
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Antimicrobial resistance can be divided into two general types, inherent or 
intrinsic property and that which is acquired. The genes for intrinsic resistance, like 
other species characteristics, appear to reside on the chromosome. Acquired 
resistance results from either a mutation in the existing DNA or acquisition of new 

5 DNA. The various inherent traits expressed by enterococci include resistance to 
semisynthetic penicillinase-resistant penicillins, cephalosporins, low levels of 
aminoglycosides, and low levels of clindamycin. Examples of acquired resistance 
include resistance to chloramphenicol, erythromycin, high levels of clindamycin, ^ 
tetracycline, high levels of aminoglycosides, penicillin by means of penicillinase, 

10 fluoroquinolones, and vancomycin. Resistance to high levels of penicillin without 
penicillinase and resistance to fluoroquinolones are not known to be plasmid or 
transposon mediated and presumably are due to mutation(s). 

Although the main reservoir for enterococci in humans is the gastrointestinal 
tract, the bacteria can also reside in the gallbladder, urethra and vagina. 

1 5 E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, 

urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and 
neonatal sepsis. See Lewis et al. (1990) supra.. In the 1970s and 1980s enterococci 
became firmly established as major nosocomial pathogens. They are now the fourth 
leading cause of hospital-acquired infection and the third leading cause of bacteremia in 

20 the United States. Fatality ratios for enterococcal bactermia range from 1 2% to 68%, 
with death due to enterococcal sepsis in 4 to 50% of these cases. See T.G. Emori 
(1993) Clin. Microbiol. Rev. 6:428-442. 

The ability of enterococci to colonize the gastrointestinal tract, plus the many 
intrinsic and acquired resistance traits, means that these organisms, which usually 

25 seem to have relatively low intrinsic virulence, are given an excellent opportunity to 
become secondary invaders. Since nosocomial isolates of enterococci have displayed 
resistance to essentially every useful antimicrobial agent, it will likely become 
increasingly difficult to successfully treat and control enterococcal infections. 
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Particularly when the various resistance genes come together in a single strain, an 
event almost certain to occur at some time in the future. 

The etiology of diseases mediated or exacerbated by Enterococcus faecalis, 
involves the programmed expression of E. faecalis genes, and that characterizing these 
5 genes and their patterns of expression would dramatically add to our understanding of 
the organism and its host interactions. Knowledge of the E. faecalis gene and genomic 
organization would improve our understanding of disease etiology and lead to 
improved and new ways of preventing, treating and diagnosing diseases. Thus, there 
is a need to characterize the genome of E. faecalis and for polynucleotides of this 
10 organism. 

Summary of the Invention 

The present invention provides for isolated E. faecalis polynucleotides and 
polypeptides shown in Table 1 and SEQ ID NO: 1 through SEQ ID NO:496 

15 (polynucleotide sequences having odd SEQ ID NOs and polypeptide sequences 

having even SEQ ID NOs). One aspect of the invention provides isolated nucleic acid 
molecules comprising polynucleotides having a nucleotide sequence selected from the 
group consisting of: (a) a nucleotide sequence shown in Table 1; (b) a nucleotide 
sequence encoding any of the amino acid sequences of the polypeptides shown in 

20 Table 1 ; and (c) a nucleotide sequence complementary to any of the nucleotide 

sequences in (a) or (b). The invention further provides for fragments of the nucleic 
acid molecules of (a), (b) & (c) above. 

Further embodiments of the invention include isolated nucleic acid molecules 
that comprise a polynucleotide having a nucleotide sequence at least 90% identical, 

25 and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the 
nucleotide sequences in (a), (b) or (c) above, or a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide in (a), (b) or (c) above. 
Additional nucleic acid embodiments of the invention relate to isolated nucleic acid 
molecules comprising polynucleotides which encode the amino acid sequences of 
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epitope-bearing portions of a E. faecalis polypeptide having an amino acid sequence in 
(a) above. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
5 the recombinant vectors, as well as to methods of making such vectors and host cells. 
The present invention further relates to the use of these vectors in the production of 
E. faecalis polypeptides or peptides by recombinant techniques. 

The invention further provides isolated E. faecalis polypeptides having an 
amino acid sequence selected from the group consisting of an amino acid sequence of 
10 any of the polypeptides described in Table 1 or fragments thereof. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence with at least 70% similarity, and more preferably at least 75%, 
80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 
1, as well as polypeptides having an amino acid sequence at least 70% identical, more 
15 preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid 
molecules encoding such polypeptides. 

The present invention further provides a single or multi-component vaccine 
comprising one or more of the 2?. faecalis polynucleotides or polypeptides described 
20 in Table 1 , or fragments thereof, together with a pharmaceutically acceptable diluent, 
carrier, or excipient, wherein the E, faecalis polypeptide(s) are present in an amount 
effective to elicit an immune response to members of the Enterococcus genus, or at 
least E. faecalis , in an animal. The E. faecalis polypeptides of the present invention 
may further be combined with one or more immunogens of one or more other 
25 Enterococcal or nori-Enterococcal organisms to produce a multi-component vaccine 
intended to elicit an immunological response against members of the Enterococcus 
genus and, optionally, one or more non-Enterococcal organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., 
"naked" DNA, wherein the DNA encodes one or more Enterococcal polypeptides 
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and, optionally, one or more polypeptides of a non-Enterococcal organism. The DNA 
encoding one or more polypeptides may be constructed such that these polypeptides 
are expressed as fusion proteins. 

The vaccines of the present invention may also be administered as a 
5 component of a genetically engineered organism or host cell. Thus, a genetically 

engineered organism or host cell which expresses one or more E. faecalis polypeptides 
may be administered to an animal. For example, such a genetically engineered 
organism or host cell may contain one or more E.faecalis polypeptides of the present 
invention intracellularly, on its cell surface, or in its periplasmic space. Further, such 
1 0 a genetically engineered organism or host cell may secrete one or more E. faecalis 
polypeptides. The vaccines of the present invention may also be co-administered to 
an animal with an immune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response 
in an animal to one or more members of the Enterococcus genus, preferably one or 
15 more isolates of the E.faecalis species, comprising administering to the animal a 
vaccine as described above. 

The invention further provides a method of inducing a protective immune 
response in an animal, sufficient to prevent, attenuate, or control an infection by 
members of the Enterococcus genus, preferably at least E. faecalis species, 
20 comprising administering to the animal a composition comprising one or more of the 
polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, 
these polypeptides, or fragments thereof, may be conjugated to another immunogen 
and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies elicited in an animal by the 
25 administration of one or more E. faecalis polypeptides of the present invention and to 
methods for producing such antibodies and fragments thereof. The invention further 
relates to recombinant antibodies and fragments thereof and to methods for producing 
such antibodies and fragments thereof. 

The invention also provides diagnostic methods for detecting the expression of 
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the polynucleotides of Table 1 by members of the Enterococcus genus in an animal. 
One such method involves assaying for the expression of a polynucleotide encoding 
E.faecalis polypeptides in a sample from an animal. This expression may be assayed 
either directly (e.g., by assaying polypeptide levels using antibodies elicited in 
5 response to amino acid sequences described in Table 1) or indirectly (e.g., by assaying 
for antibodies having specificity for amino acid sequences described in Table 1). The 
expression of polynucleotides can also be assayed by detecting the nucleic acids of 
Table 1 . An example of such a method involves the use of the polymerase chain 
reaction (PCR) to amplify and detect Enterococcus nucleic acid sequences. 

10 The present invention also relates to nucleic acid probes having all or part of a 

nucleotide sequence described in Table 1 (odd SEQ ID NOs) which are capable of 
hybridizing under stringent conditions to Enterococcus nucleic acids. The invention 
further relates to a method of detecting one or more Enterococcus nucleic acids in a 
biological sample obtained from an animal, said one or more nucleic acids encoding 

15 Enterococcus polypeptides, comprising: (a) contacting the sample with one or more 
of the above-described nucleic acid probes, under conditions such that hybridization 
occurs, and (b) detecting hybridization of said one or more probes to the Enterococcus 
nucleic acid present in the biological sample. 

Other uses of the polypeptides of the present invention include: inter alia, to 

20 detect E. frftraHs in immunoassays, as epitope tags, as molecular weight markers on 
SDS-PAGE gels, as molecular weight markers for molecular sieve gel filtration 
columns, to generate antibodies that specificaly bind E.faecalis polypeotides of the 
present invention for the detection E.faecalis in immunoassays, to generate an 
immune response against E. faecalis and other Enterococcus species, and as vaccines 

25 against E. faecalis, other Enterococcus species and other bacteria genuses. 

Isolated nucleic acid molecules of the present invention, particularly DNA 
molecules, are useful as probes for gene mapping and for identifying E.faecalis in a 
biological samples, for instance, by Southern and Northern blot analysis. 
Polynucleotides of the present invention are also useful in detecting E. faecalis by 
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PCR using primers for a particular E. faecalis polynucleotide. Isolated 
polynucleotides of the present invention are also useful in making the polypeptides of 
the present invention. 

5 Detailed Description 

The present invention relates to recombinant E. faecalis nucleic acids and 
fragments thereof. The present invention further relates to recombinant E. faecalis 
polypeptides and fragments thereof. The invention also relates to methods for using 
these polypeptides to produce immunological responses and to confer immunological 

10 protection to disease caused by members of the genus Enter vcoccus, at least isolates 
of the E. faecalis genus. The invention further relates to nucleic acid sequences which 
encode antigenic E. faecalis polypeptides and to methods for detecting E. faecalis 
nucleic acids and polypeptides in biological samples. The invention also relates to 
antibodies specific for the polypeptides and peptides of the present invention and 

15 methods for detecting such antibodies produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter which the 
inventors consider to be the present invention. 
20 As used herein, the phrase "pathogenic agent" means an agent which causes a 

disease state or affliction in an animal. Included within this definition, for examples, 
are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a 
disease state or render an animal infected with such an organism susceptible to a 
disease state a secondary infection). Further included are species and strains of 
25 the genus Enterococcus which produce disease states in animals. 

As used herein, the term "organism" means any living biological system, 
including viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Enterococcus" means any species or strain of 
bacteria which is members of the genus Enterococcus. Such species and strains are 
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known to those of skill in the art, and include those that are pathogenic and those that 
are not. 

As used herein, the phrase "one or more E.faecalis polypeptides of the 
present invention 11 means polypeptides comprising the amino acid sequence of one or 

5 more of the E.faecalis polypeptides described in Table 1 (even SEQ ID NOs). These 
polypeptides may be expressed as fusion proteins wherein the E. faecalis 
polypeptides of the present invention are linked to additional amino acid sequences 
which may be of Enterococcal or non-Enterococcal origin. This phrase further 
includes polypeptide comprising fragments of the E.faecalis polypeptides of the 

K) present invention. Additional definitions are provided throughout the specification. 

Explanation of Table 1 

Table 1 , below, provides information describing genes which encode 
polypeptides of E.faecalis. The table lists the gene identifier which consists of the 

15 letters EF, which denote E.faecalis, followed immediately by a three digit numeric 
code, which arbitrarily number the E.faecalis genes of the present invention. A 
number from 1 through 4 follows the three digit number. A number 1 represents the 
full length open reading frame of the gene specified by the preceeding three digit 
number. A number 2 represents the full leng+fc polypeptide encoded by the gene 

20 specified the preceeding three digit number. A number 3 represents a polynucleotide 
fragment, of the gene represented by the preceeding three digit number, used to 
produce an antigenic polypeptide. A number 4 represents an antigenic polypeptide 
fragment, of the gene represented by the preceeding three digit number, used to 
stimulate an immune response or as a vaccine. The nucleotide and amino acid 

25 sequences of each gene and fragment are also shown in the Sequence Listing under the 
SEQ ID NO listed in Table 1. 

Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between 
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the polypeptides of the present invention and those available through GenBank and 
Derwent databases. These reference numbers are the database entry numbers 
commonly used by those of skill in the art, who will be familar with their 
denominations. The descriptions of the numenclature for GenBank are available from 
5 the National Center for Biotechnology Information. Column 1 lists the gene or ORF 
of the present invention. Column 2 lists the accession number of a "match" gene 
sequence in GenBank or Derwent databases. Column 3 lists the description of the 
"match" gene sequence. Columns 4 and 5 are the high score and smallest sum 
probability, respectively, calculated by BLAST. Polypeptides of the present 
10 invention that do not share significant identity/similarity with any polypeptide 

sequences of GenBank and Derwent are not represented in Table 2. Polypeptides of 
the present invention that share significant identity/similarity with more than one of 
the polypeptides of GenBank and Derwent are represented more than once. 

1 5 Explanation of Table 3. 

The E. faecalis polypeptides of the present invention may include one or more 
conservative amino acid substitutions from natural mutations or human manipulation 
as indicated in Table 3. Changes are preferably of a minor nature, such as conservative 
amino acid substitutions that do not significantly affect the folding or activity of the 

20 protein. Residues from the following groups, as indicated in Table 3, may be 

substituted for one another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and Small, 

Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope- 
25 bearing fragments present in each of the full length E. faecalis polypeptides described 
in Table 1 as predicted by the inventors using the algorithm of Jameson and Wolf, 
(1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was 
performed using the computer program PROTEAN (Version 3. 1 1 for the Power 
Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, Wl). E.faecalis 
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polypeptide shown in Table 1 may one or more antigenic epitopes comprising 
residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant 
may vary slightly. The residues and locations shown described in Table 4 correspond 
5 to the amino acid sequences for each full length gene sequence shown in Table 1 and in 
the Sequence Listing. Polypeptides of the present invention that do not have 
antigenic epitopes recognized by the Jameson-Wolf algorithm are not represented in 
Table 2. 

1 0 Selection of Nucleic Acid Sequences Encoding Antigenic E. faecalis Polypeptides 

Sequenced E. faecalis genomic DNA was obtained from the E. faecalis strain 
V586. The E. faecalis strain V586 was deposited 2 May 1997 at the ATCC, 10801 
University Blvd. Manassas, VA 201 10-2209, and given accession number 55969. 
Some ORFs contained in the subset of fragments of the E. faecalis genome 

15 disclosed herein were derived through the use of a number of screening criteria detailed 
below. The ORFs are bounded at the amino terminus by a methionine or valine 
residue and usually at the carboxy terminus by a stop codon. 

Most of the selected sequences consist of complete ORFs. The polypeptides 
that do not comprise a complete ORF can be determined by determining whether the 

20 corresponding polynucleotide sequence comprises a stop codon after the codon for 
the last amino acid residue in the polypeptide sequence. It is not always preferred to 
express a complete ORF in a heterologous system. It may be challenging to express 
and purify a highly hydrophobic protein by common laboratory methods. Some of 
the polypeptide vaccine candidates described herein have been modified slightly to 

25 simplify the production of recombinant protein. For example, nucleotide sequences 
which encode highly hydrophobic domains, such as those found at the amino terminal 
signal sequence, have been excluded from some constructs used for expression of the 
polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring 
at the caiboxy terminus have also been excluded from the recombinant expression 
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constructs. Thus, in one embodiment, a polypeptide which represents a truncated or 
modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially 
immunogenic polypeptides, many of the ORFs disclosed herein were selected on the 
5 basis of screening Enterococcus faecalis ORFs for several aspects of potential 
immunogenicity. One set of selection criteria are as follows: 

1 . Type I signal sequence: An amino terminal type I signal sequence generally 
directs a nascent protein across the plasma and outer membranes to the exterior of the 
bacterial cell. Experimental evidence obtained from studies with Escherichia coli 

10 suggests that the typical type I signal sequence consists of the following biochemical 
and physical attributes (Izard, J. W. and Kendall, D. A. Mol Microbiol 75:765-773 
(1994)). The length of the type I signal sequence is approximately 15 to 25 primarily 
hydrophobic amino acid residues with a net positive charge in the extreme amino 
terminus. In addition, the central region of the signal sequence adopts an alpha-helical 

15 conformation in a hydrophobic environment. Finally, the region surrounding the 

actual site of cleavage is ideally six residues long, with small side-chain amino acids in 
the -1 and -3 positions. 

2. Type IV signal sequence: The type IV signal sequence is an example of the 
several types of functional signal sequences which exist in addition to the type I signal 

20 sequence detailed above. Although functionally related, the type IV signal sequence 
possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, 
S., J. Bacteriol 774:7345-7351 (1992)). These are typically six to eight amino acids 
with a net basic charge followed by an additional sixteen to thirty primarily 
hydrophobic residues. The cleavage site of a type IV signal sequence is typically after 

25 the initial six to eight amino acids at the extreme amino terminus. In addition, type IV 
signal sequences generally contain a phenylalanine residue at the +1 site relative to the 
cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial 
lipoprotein precursors has allowed the definition of a consensus amino acid sequence 
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for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors 
examined contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to 
the point of cleavage (Hayashi, S. and Wu, H. C, J. Bioenerg. Biomembr. 22:451-47 1 
(1990)). 

5 4. LPXTG motif: It has been experimentally determined that most anchored 

proteins found on the surface of gram-positive bacteria possess a highly conserved 
carboxy terminal sequence. More than fifty such proteins from organisms such as S. 
pyogenes, S. mutans, E. faecalis, S. pneumoniae, and others, have been identified based 
on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V. 

10 A., ASM News 62:405-410 (1996)). The conserved region consists of six charged 
amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino 
acids presumed to function as a transmembrane domain. Immediately adjacent to the 
transmembrane domain is a six amino acid sequence conserved in nearly all proteins 
examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any 

15 amino acid. 

An algorithm for selecting antigenic and immunogenic Enterococcus faecalis 
polypeptides including the foregoing criteria was developed. The algorithm is similar 
to that described in U.S. patent application 08/781 ,986, filed January 3, 1997, which 
is fully incorporated by reference herein. Use of the algorithm by the inventors to 
20 select immunologically useful Enterococcus faecalis polypeptides resulted in the 
selection of a number of the disclosed ORFs. Polypeptides comprising the 
polypeptides identified in this group may be produced by techniques standard in the 
art and as further described herein. 

25 Nucleic Acid Molecules 

Sequenced E. faecalis genomic DN A was obtained from the E. faecalis strain V586. As 
discussed elsewhere hererin, polynucleotides of the present invention readily may be 
obtained by routine application of well known and standard procedures for cloning 
and sequencing DNA. Detailed methods for obtaining libraries and for sequencing are 
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provided below, for instance. A wide variety of Enterococcus faecalis strains that can 
be used to prepare E. faecalis genomic DNA for cloning and for obtaining 
polynucleotides and polypeptides of the present invention. A wide variety of 
Enterococcus faecalis strains are available to the public from recognized depository 

5 institutions, such as the American Type Culture Collection (ATCC). It is recognized 
that minor variation is the nucleic acid and amino acid sequence may be expected from 
E faecalis strain to strain. The present invention provides for genes, including both 
polynucleotides and polypeptides, of the of the present invention from all the 
Enterococcus faecalis strains. 

10 Unless otherwise indicated, all nucleotide sequences determined by sequencing 

a DNA molecule herein were determined using an automated DNA sequencer (such as 
the Model 373 from Applied Biosystems, Inc., Foster City, CA), and all amino acid 
sequences of polypeptides encoded by DNA molecules determined herein were 
predicted by translation of a DNA sequence determined as above. Therefore, as is 

15 known in the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more 
typically at least about 95% to at least about 99.9% identical to the actual nucleotide 
sequence of the sequenced DNA molecule. The actual sequence can be more 

20 precisely determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or deletion in a 
determined nucleotide sequence compared to the actual sequence will cause a frame 
shift in translation of the nucleotide sequence such that the predicted amino acid 
sequence encoded by a determined nucleotide sequence will be completely different 

25 from the amino acid sequence actually encoded by the sequenced DNA molecule, 
beginning at the point of such an insertion or deletion. In case of conflict between 
Table 1 and either the nucleic acid sequence of the clones listed in Table 1 or the amino 
acid sequence of the protein expressed by the clones listed in Table 1, the clones listed 
in Table 1 are controlling. By "nucleotide sequence" of a nucleic acid molecule or 



WO 98/50554 



-14- 



PCT/US98/08959 



polynucleotide is intended to mean either a DNA or RNA sequence.Using the 
information provided herein, such as the nucleotide sequence in Table 1 , a nucleic acid 
molecule of the present invention encoding a Kfaecalis polypeptide may be obtained 
using standard cloning and screening procedures, such as those for cloning DNAs 

5 using genomic DNA as starting material. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nded. 
1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY 
(John Wiley and Sons, N.Y. 1989). Illustrative of the invention, the nucleic acid 
molecule described in Table 1 was discovered in a DNA library derived from aJ?. 

1 0 faecalis genomic DNA. 

Nucleic acid molecules of the present invention may be in the form of RNA, 
such as mRNA, or in the form of DNA, including, for instance, DNA and genomic 
DNA obtained by cloning or produced synthetically. The DNA may be 
double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding 

15 strand, also known as the sense strand, or it may be the non-coding strand, also 
referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. This includes 
segments of DNA comprising the E. faecalis polynucleotides of the present invention 

20 isolated from the native chromosome. These fragments include both isolated 

fragments consisting only of E. faecalis DNA and fragments comprising heterologous 
sequences such as vector sequences or other foreign DNA. For example, recombinant 
DNA molecules contained in a vector are considered isolated for the purposes of the 
present invention. Further examples of isolated DNA molecules include recombinant 

25 DNA molecules maintained in heterologous host cells or purified (partially or 

substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or 
in vitro RNA transcripts of the DNA molecules of the present invention. Isolated 
nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 
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In addition, isolated nucleic acid molecules of the invention include DNA 
molecules which comprise a sequence substantially different from those described 
above but which, due to the degeneracy of the genetic code, still encode a E. faecalis 
polypeptides and peptides of the present invention (e.g. polypeptides of Table 1). 

5 That is, all possible DNA sequences that encode the E. faecalis polypeptides of the 
present invention. This includes the genetic code and species-specific codon 
preferences known in the art. Thus, it would be routine for one skilled in the art to 
generate the degenerate variants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those 

10 preferred by a mammalian or other bacterial host such as E. coli). 

The invention further provides isolated nucleic acid molecules having the 
nucleotide sequence shown in Table 1 or a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated molecules, particularly 
DNA molecules, are useful as probes for gene mapping and for identifying E. faecalis 

15 in a biological sample, for instance, by PCR, Southern blot, Northern blot, or other 
form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding 
portions or fragments of the nucleotide sequences described herein. Fragments include 
portions of the nucleotide sequences of Table 1, or the E. faecalis nucleotide 

20 sequences contained in the plasimd clones listed in Table 1 , at least 10 contiguous 
nucleotides in length selected from any two integers, one of which representing a 5' 
nucleotide position and a second of which representing a 3 1 nucleotide position, where 
the first nucleotide for each nucleotide sequence in Table 1 is position 1 . That is, 
every combination of a 5 1 and 3' nucleotide position that a fragment at least 10 

25 contiguous nucleotides in length could occupy is included in the invention. At least 
means a fragment may be 10 contiguous nucleotide bases in length or any integer 
between 10 and the length of an entire nucleotide sequence of Table 1 minus 1. 
Therefore, included in the invention are contiguous fragments specified by any 5' and 
3' nucleotide base positions of a nucleotide sequences of Table 1 wherein the 
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contiguous fragment is any integer between 10 and the length of an entire nucleotide 
sequence minus 1 . 

Further, the invention includes polynucleotides comprising fragments specified 
by size, in nucleotides, rather than by nucleotide positions. The invention includes 
5 any fragment size, in contiguous nucleotides, selected from integers between 10 and 
the length of an entire nucleotide sequence minus 1 . Preferred sizes of contiguous 
nucleotide fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, 50 
nucleotides. Other preferred sizes of contiguous nucleotide fragments, which may be 
useful as diagnostic probes and primers, include fragments 50-300 nucleotides in 

10 length which include, as discussed above, fragment sizes representing each integer 

between 50-300. Larger fragments are also useful according to the present invention . 
corresponding to most, if not all, of the nucleotide sequences shown in Table lor of 
the E.faecalis nucleotide sequences of the plasimd clones listed in Table 1. The 
preferred sizes are, of course, meant to exemplify not limit the present invention as all 

15 size fragments, representing any integer between 1 0 and the length of an entire 
nucleotide sequence minus 1, are included in the invention. Additional preferred 
nucleic acid fragments of the present invention include nucleic acid molecules encoding 
epitope-bearing portions of E. faecalis polypeptides identified in Table 4. 

The present invention also provides for the exclusion of any fragment, 

20 specified by 5' and 3 1 base positions or by size in nucleotide bases as described above 
for any nucleotide sequence of Table 1 or the plasimd clones listed in Table 1 . Any 
number of fragments of nucleotide sequences in Table 1 or the plasimd clones listed in 
Table 1, specified by 5' and 3 1 base positions or by size in nucleotides, as described 
above, may be excluded from the present invention. 

25 In another aspect, the invention provides an isolated nucleic acid molecule 

comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of a polynucleotide in a nucleic acid molecules of the invention 
described above, for instance, nucleotide sequences of Table 1 or the E.faecalis 
sequences of the plasimd clones listed in Table 1. By "stringent hybridization 
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conditions" is intended overnight incubation at 42°C in a solution comprising: 50% 
formamide, 5x SSC (150 mM NaCl, 1 5 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denhardfs solution, 10% dextran sulfate, and 20 (ig/ml 
denatured, sheared salmon sperm DNA, followed by washing the filters in O.lx SSC at 
5 about 65°C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide is 
intended a polynucleotide (either DNA or RNA) hybridizing to at least about 15 
nucleotides bases, and more preferably at least about 20 nucleotides bases, still more 
preferably at least about 30 nucleotides bases, and even more preferably about 30-70 

10 (e.g., 50) nucleotides bases of the reference polynucleotide. These are useful as 

diagnostic probes and primers as discussed above. By a portion of a polynucleotide 
of "at least 20 nucleotides bases in length," for example, is intended 20 or more 
contiguous nucleotides bases nucleotides from the nucleotide sequence of the reference 
polynucleotide (e.g., the nucleotide sequence as shown in Table 1). Portions of a 

1 5 polynucleotide which hybridizes to a nucleotide sequence in Table 1 , which can be 
used as probes and primers, may also be precisely specified by 5' and 3* base 
positions or by size in nucleotide bases as described above or precisely excluded in the 
same manner. 

The nucleic acid molecules of the present invention include those encoding the 
20 full length E. faecalis polypeptides of Table 1 and portions of the E. faecalis 
polypeptides of Table 1. Also included in the present invention are nucleic acids 
encoding the above full length sequences and further comprise additional sequences, 
such as those encoding an added secretory leader sequence, such as a pre-, or pro- or 
prepro- protein sequence. Further included in the present invention are nucleic acids 
25 encoding the above full length sequences and portions thereof and further comprise 
additional heterologous amino acid sequences encoded by nucleic acid sequences from 
a different source. 

Also included in the present invention are nucleic acids encoding the above 
protein sequences together with additional, non-coding sequences, including for 
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example, but not limited to non-coding 5' and 3* sequences. These sequences include 
transcribed, non-translated sequences that may play a role in transcription, and 
mRNA processing, for example, ribosome binding and stability of mRNA. Also 
included in the present invention are additional coding sequences which provide 
5 additional functionalities. 

Thus, a nucleotide sequence encoding a polypeptide may be fused to a marker 
sequence, such as a sequence encoding a peptide which facilitates purification of the 
fused polypeptide. In certain preferred embodiments of this aspect of the invention, 
the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in 

1 0 a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 9 1 3 1 1 ), among 
others, many of which are commercially available. For instance, hexa-histidine 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) 
Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for 
purification which corresponds to an epitope derived from. the influenza hemagglutinin 

15 protein. See Wilson et al. (1984) Cell 37:767. As discussed below, other such fusion 
proteins include the E. faecalis polypeptides of the present invention fused to Fc at 
the N- or C-tenninus. 

Variant and Mutant Polynucleotides 
20 The present invention further relates to variants of the nucleic acid molecules 

which encode portions, analogs or derivatives of a is. faecalis polypeptides of Table 1 

and variant polypeptides thereof including portions, analogs, and derivatives of the E. 

faecalis polypeptides. Variants may occur naturally, such as a natural allelic variant. 

By an "allelic variant" is intended one of several alternate forms of a gene occupying a 
25 given locus on a chromosome of an organism. See, e.g., B. Lewin, Genes IV (1990). 

Non-naturally occurring variants may be produced using art-known mutagenesis 

techniques. 

Such nucleic acid variants include those produced by nucleotide substitutions, 
deletions, or additions. The substitutions, deletions, or additions may involve one or 
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more nucleotides. The variants may be altered in coding regions, non-coding regions, 
or both. Alterations in the coding regions may produce conservative or 
non-conservative amino acid substitutions, deletions or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
5 alter the properties and activities of a E. faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 
substitutions. 

Such polypeptide variants include those produced by amino acid 
substitutions, deletions or additions. The substitutions, deletions, or additions may 

10 involve one or more residues. Alterations may produce conservative or 

non-conservative amino acid substitutions, deletions, or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of a E. faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 

15 substitutions. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
the recombinant vectors, as well as to methods of making such vectors and host cells 
and for using them for production of E. faecalis polypeptides or peptides by 

20 recombinant techniques. 

The present application is directed to nucleic acid molecules at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1 . 
The above nucleic acid sequences are included irrespective of whether they encode a 
polypeptide having E. faecalis activity. This is because even where a particular 

25 nucleic acid molecule does not encode a polypeptide having E. faecalis activity, one of 
skill in the art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having E. faecalis activity include, inter alia, isolating an 
E. faecalis gene or allelic variants thereof from a DNA library, and detecting E, faecalis 
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mRNA expression samples, environmental samples, suspected of containing E. 
faecalis by Northern Blot analysis. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 
97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, 
5 in fact, encode a polypeptide having E. faecalis protein activity By "a polypeptide 
having E. faecalis activity" is intended polypeptides exhibiting activity similar, but 
not necessarily identical, to an activity of the E. faecalis protein of the invention, as 
measured in a particular biological assay suitable for measuring activity of the 
specified protein. 

10 Due to the degeneracy of the genetic code, one of ordinary skill in the art will 

immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid 
sequences shown in Table 1 will encode a polypeptide having E. faecalis protein 
activity. In fact, since degenerate variants of these nucleotide sequences all encode the 

15 same polypeptide, this will be clear to the skilled artisan even without performing the 
above described comparison assay. It will be further recognized in the art that, for 
such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fully aware of amino acid substitutions that are either less likely or 

20 not likely to significantly effect protein function (e.g., replacing one aliphatic amino 
acid with a second aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present 
invention are expected to be similar or identical to polypeptides from other bacteria 
that share a high degree of structural identity/similarity. Tables 2 lists accession 

25 numbers and descriptions for the closest matching sequences of polypeptides 

available through Genbank and Derwent databases. It is therefore expected that the 
biological activity or function of the polypeptides of the present invention will be 
similar or identical to those polypeptides from other bacterial genuses, species, or 
strains listed in Table 2. 
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By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
5 mutations per each 1 00 nucleotides of the reference nucleotide sequence encoding the 
E. faecalis polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 
5% of the nucleotides in the reference sequence may be deleted, inserted, or 
substituted with another nucleotide. The query sequence may be an entire sequence 
10 shown in Table 1 , the ORF (open reading frame), or any fragment specified as 
described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 

1 5 computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using the 
FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et 
al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and 

20 subject sequences are both DNA sequences. An RNA sequence can be compared by 
first converting LPs to T's. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, 
Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap 

25 Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3* 
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truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 
5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the 
5 total bases of the query sequence. Whether a nucleotide is matched/aligned is 

determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only nucleotides 

10 outside the 5' and 3' nucleotides of the subject sequence, as displayed by the 
FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide 
query sequence to determine percent identity. The deletions occur at the 5' end of the 

15 subject sequence and therefore, the FASTDB alignment does not show a 

matched/alignment of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not 
matched/total number of nucleotides in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 

20 nucleotides were perfectly matched the final percent identity would be 90%. In 

another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide 
query sequence. This time the deletions are internal deletions so that there are no 
nucleotides on the 5' or 3* of the subject sequence which are not matched/aligned with 
the query. In this case the percent identity calculated by FASTDB is not manually 

25 corrected. Once again, only nucleotides 5' and 3' of the subject sequence which are 
not matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to made for the purposes of the present invention. 

Vectors and Host Cell 
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The present invention also relates to vectors which include the isolated DNA 
molecules of the present invention, host cells comprising the recombinant vectors, and 
the production of E, faecalis polypeptides and peptides of the present invention 
expressed by the host cells. 
5 Recombinant constructs may be introduced into host cells using well known 

techniques such as infection, transduction, transfection, transvection, electroporation 
and transformation. The vector may be, for example, a phage, plasmid, viral or 
retroviral vector. Retroviral vectors may be replication competent or replication 
defective. In the latter case, viral propagation generally will occur only in 

10 complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 

15 and then transduced into host cells. 

Preferred are vectors comprising c/s-acting control regions to the 
polynucleotide of interest. Appropriate trans-acting factors may be supplied by the 
host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

20 In certain preferred embodiments in this regard, the vectors provide for 

specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 

25 episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, 
such as cosmids and phagemids. 
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The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
5 constructs will further contain sites for transcription initiation, termination and, in the 
transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will preferably include a translation 
initiating site at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

10 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin, or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 

15 Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, 

20 pQElO available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, 
pNH8A, pNH16a, pNH18A, pNH46A available from Stratagene; pET series of 
vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, pBPV, 

25 pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily 
apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the E. coli lac\ and lacZ promoters, the T3, T5 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic 
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promoters include the CMV immediate early promoter, the HSV thymidine kinase 
promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 
as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the 
mouse metallothionein-I promoter. 

5 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection or other methods. Such methods 
are described in many standard laboratory manuals (for example, Davis, ei al, Basic 
Methods In Molecular Biology ( 1 986)) . 

10 Transcription of DNA encoding the polypeptides of the present invention by 

higher eukaryotes may be increased by inserting an enhancer sequence into the vector. 
Enhancers are exacting elements of DNA, usually about from 10 to 300 nucleotides 
that act to increase transcriptional activity of a promoter in a given host cell-type. 
Examples of enhancers include the SV40 enhancer, which is located on the late side of 

15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and 
adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 

20 appropriate secretion signals may be incorporated into the expressed polypeptide, for 
example, the amino acid sequence KDEL. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional heterologous 

25 functional regions. For instance, a region of additional amino acids, particularly 

charged amino acids, may be added to the N-terminus of the polypeptide to improve 
stability and persistence in the host cell, during purification, or during subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to 
facilitate purification. Such regions may be removed prior to final preparation of the 
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polypeptide. The addition of peptide moieties to polypeptides to engender secretion 
or excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. A preferred fusion protein comprises a 
heterologous region from immunoglobulin that is useful to solubilize proteins. For 
5 example, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together 
with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is thoroughly advantageous for use in therapy and diagnosis and thus results, 
for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 

10 hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and purified in the advantageous manner 
described. This is the case when Fc portion proves to be a hindrance to use in 
therapy and diagnosis, for example when the fusion protein is to be used as antigen for 
immunizations. In drug discovery, for example, human proteins, such as, 

15 hIL5 -receptor has been fused with Fc portions for the purpose of high-throughput 
screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J. 
Molec. Recogn. 8:52-58 and Johanson, K. et al. (1995) J. Biol. Chem. 270 
(16):9459-9471. 

The E. faecalis polypeptides can be recovered and purified from recombinant 
20 cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography, lectin chromatography and high 
performance liquid chromatography ("HPLC") is employed for purification. 
25 Polypeptides of the present invention include naturally purified products, products of 
chemical synthetic procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaiyotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 
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Polypeptides and Fragments 

The invention further provides an isolated E. faecalis polypeptide having an 
amino acid sequence in Table 1, or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

5 

Variant and Mutant Polypeptides 

To improve or alter the characteristics of E. faecalis polypeptides of the 
present invention, protein engineering may be employed. Recombinant DNA 
technology known to those skilled in the art can be used to create novel mutant 
10 proteins or muteins including single or multiple amino acid substitutions, deletions, 
additions, or fusion proteins. Such modified polypeptides can show, e.g., enhanced 
activity or increased stability. In addition, they may be purified in higher yields and 
show better solubility than the corresponding natural polypeptide, at least under 
certain purification and storage conditions. 

15 

N-Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the 
N-terminus or C-terminus without substantial loss of biological function. For 
instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 

20 proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the E. faecalis polypeptides shown in Table 1, and polynucleotides 
encoding such polypeptides. 

25 Similarly, many examples of biologically functional C-terminal deletion 

muteins are known. For instance, Interferon gamma shows up to ten times higher 
activities by deleting 8-10 amino acid residues from the carboxy terminus of the 
protein See, e.g., Dobeli, et al. (1988) J. Biotechnology 7:199-216. Accordingly, the 
present invention provides polypeptides having one or more residues from the 
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carboxy terminus of the amino acid sequence of the E.faecalis polypeptides shown in 
Table 1. The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions 
5 or fragments of the amino acid sequences described herein as well as to portions or 
fragments of the isolated amino acid sequences described herein. Fragments include 
portions of the amino acid sequences of Table 1, are at least 5 contiguous amino acid 
in length, are selected from any two integers, one of which representing a N-terminal 
position. The initiation codon of the polypeptides of the present inventions position 

10 1 . Every combination of a N-terminal and C-terminal position that a fragment at least 
5 contiguous amino acid residues in length could occupy, on any given amino acid 
sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of 
residues in a full length amino acid sequence minus 1 . Therefore, included in the 

15 invention are contiguous fragments specified by any N-terminal and C-terminal 

positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment 
is any integer between 5 and the number of residues in a full length sequence minus 1. 

Further, the invention includes polypeptides comprising fragments specified 
by size, in amino acid residues, rather than by N-terminal and C-terminal positions. 

20 The invention includes any fragment size, in contiguous amino acid residues, selected 
from integers between 5 and the number of residues in a full length sequence minus 1 . 
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid 
residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino 
acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100 

25 amino acid residues, about 200 amino acid residues, about 300 amino acid residues, 
and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any 
integer between 5 and the number of residues in a full length sequence minus 1 are 
included in the invention. The present invention also provides for the exclusion of any 
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fragments specified by N-terminal and C-terminal positions or by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and 
C-terminal positions or by size in amino acid residues as described above may be 
excluded. 

5 The above fragments need not be active since they would be useful, for 

example, in immunoassays, in epitope mapping, epitope tagging, to generate 
antibodies to a particular portion of the protein, as vaccines, and as molecular weight 
markers. 

10 Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, 
it also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the E.faecalis polypeptide can be varied without significant effect of the 
structure or function of the protein. If such differences in sequence are contemplated, 

15 it should be remembered that there will be critical areas on the protein which 
determine activity. 

Thus, the invention further includes variations of the E. faecalis polypeptides 
which show substantial E.faecalis polypeptide activity or which include regions of E. 
faecalis protein such as the protein portions discussed below. Such mutants include 

20 deletions, insertions, inversions, repeats, and type substitutions selected according to 
general rules known in the art so as to have little effect on activity. For example, 
guidance concerning how to make phenotypically silent amino acid substitutions is 
provided. There are two main approaches for studying the tolerance of an amino acid 
sequence to change. See, Bowie, J. U. et al (1990), Science 247:1306-1310. The first 

25 method relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or 
screens to identify sequences that maintain functionality. 

These studies have revealed that proteins are surprisingly tolerant of amino 
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acid substitutions. The studies indicate which amino acid changes are likely to be 
permissive at a certain position of the protein. For example, most buried amino acid 
residues require nonpolar side chains, whereas few features of surface side chains are 
generally conserved. Other such phenotypically silent substitutions are described by 

5 Bowie et al. (supra) and the references cited therein. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of 
the acidic residues Asp and Glu, substitution between the amide residues Asn and 
Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 

10 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 
1, or that encoded by the plaimds listed in Table 1, may be: (i) one in which one or 
more of the amino acid residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid residue) and such substituted 

15 amino acid residue may or may not be one encoded by the genetic code: or (ii) one in 
which one or more of the amino acid residues includes a substituent group: or (iii) one 
in which the E.faecalis polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene 
glycol): or (iv) one in which the additional amino acids are fused to the above form of 

20 the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory 
sequence or a sequence which is employed for purification of the above form of the 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the E. faecalis polypeptides of the present invention may include one or 

25 more amino acid substitutions, deletions, or additions, either from natural mutations or 
human manipulation. As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding or 
activity of the protein (see Table 3). 

Amino acids in the £. faecalis proteins of the present invention that are 
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essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis. See, e.g., Cunningham et al. 
(1989) Science 244:1081-1085. The latter procedure introduces single alanine 
mutations at every residue in the molecule, The resulting mutant molecules are then 
5 tested for biological activity using assays appropriate for measuring the function of 
the particular protein. 

Of special interest are substitutions of charged amino acids with other charged 
or neutral amino acids which may produce proteins with highly desirable improved 
characteristics, such as less aggregation. Aggregation may not only reduce activity but 

10 also be problematic when preparing pharmaceutical formulations, because aggregates 
can be immunogenic. See, e.g., Pinckard et al, (1967) Clin. Exp. Immunol. 2:331-340; 
Robbins, et al., (1987) Diabetes 36:838-845; Cleland, et al., (1993) Crit. Rev. 
Therapeutic Drug Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an 

1 5 isolated form, and preferably are substantially purified. A recombinantly produced 
version of the E. faecalis polypeptide can be substantially purified by the one-step 
method described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the 
invention also can be purified from natural or recombinant sources using antibodies 
directed against the polypeptides of the invention in. methods which are well known in 

20 the art of protein purification. 

The invention further provides for isolated E. faecalis polypeptides 
comprising an amino acid sequence selected from the group consisting of: (a) the 
amino acid sequence of a full-length E. faecalis polypeptide having the complete 
amino acid sequence shown in Table 1 ; (b) the amino acid sequence of a full-length E. 

25 faecalis polypeptide having the complete amino acid sequence shown in Table 1 
excepting the N-terminal methionine; (c) the complete amino acid sequence encoded 
by the plaimds listed in Table 1; and (d) the complete amino acid sequence excepting 
the N-terminal methionine encoded by the plaimds listed in Table 1 . The 
polypeptides of the present invention also include polypeptides having an amino acid 
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sequence at least 80% identical, more preferably at least 90% identical, and still more 
preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a), (b), (c), 
and (d) above. 

Further polypeptides of the present invention include polypeptides which 
5 have at least 90% similarity, more preferably at least 95% similarity, and still more 
preferably at least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of a E.faecalis polypeptide having an amino acid 
sequence which contains at least one conservative amino acid substitution, but not 

10 more than 50 conservative amino acid substitutions, not more than 40 conservative 
amino acid substitutions, not more than 30 conservative amino acid substitutions, and 
not more than 20 conservative amino acid substitutions. Also provided are 
polypeptides which comprise the amino acid sequence of a E.faecalis polypeptide, 
having at least one, but not more than 1 0, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 

15 acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 

20 alterations per each 100 amino acids of the query amino acid sequence. In other 

words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 

25 terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 
95%, 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences 



WO 98/50554 



-33- 



PCT/US98/08959 



shown in Table 1 or to the amino acid sequence encoded by the plaimds listed in Table 
1 can be determined conventionally using known computer programs. A preferred 
method for determining the best overall match between a query sequence (a sequence 
of the present invention) and a subject sequence, also referred to as a global sequence 
5 alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence 
alignment the query and subject sequences are both amino acid sequences. The result 
of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch 

l o Penal ty= 1 , Joining Penalty=20, Randomization Group Length=0, Cutoff Score= 1 , 
Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, the results, in percent identity, 

15 must be manually corrected. This is because the FASTDB program does not account 
for N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative to 
the query sequence, the percent identity is corrected by calculating the number of 
residues of the query sequence that are N- and C-terminal of the subject sequence, 

20 which are not matched/aligned with a corresponding subject residue, as a percent of 
the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This final percent 

25 identity score is what is used for the purposes of the present invention. Only 
residues to the N- and C-termini of the subject sequence, which are not 
matched/aligned with the query sequence, are considered for the purposes of manually 
adjusting the percent identity score. That is, only query amino acid residues outside 
the farthest N- and C-terminal residues of the subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
match/align with the first 10 residues at the N-terminus. The 10 unpaired residues 
5 represent 10% of the sequence (number of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query 

10 sequence. This time the deletions are internal so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once 
again, only residue positions outside the N- and C-terminal ends of the subject 
sequence, as displayed in the FASTDB alignment, which are not matched/aligned 

15 with the query sequence are manually corrected. No other manual corrections are to 
made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they 
have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would 

20 still know how to use the polypeptide, for instance, as a vaccine or to generate 

antibodies. Other uses of the polypeptides of the present invention that do not have 
E.faecalis activity include, inter alia, as epitope tags, in epitope mapping, and as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods known to those of skill in the art. 

25 As described below, the polypeptides of the present invention can also be 

used to raise polyclonal and monoclonal antibodies, which are useful in assays for 
detecting E.faecalis protein expression or as agonists and antagonists capable of 
enhancing or inhibiting E.faecalis protein function. Further, such polypeptides can be 
used in the yeast two-hybrid system to "capture" E.faecalis protein binding proteins 
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which are also candidate agonists and antagonists according to the present invention. 
See, e.g., Fields et al. (1989) Nature 340:245-246. 

Epitope-Bearing Portions 
5 In another aspect, the invention provides peptides and polypeptides 

comprising epitope-bearing portions of the E. faecalis polypeptides of the present 
invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides 
of the present invention. An "immunogenic epitope" is defined as a part of a protein 
that elicits an antibody response when the whole protein or polypeptide is the 

10 immunogen. These immunogenic epitopes are believed to be confined to a few loci on 
the molecule. On the other hand, a region of a protein molecule to which an antibody 
can bind is defined as an "antigenic determinant" or "antigenic epitope." The number 
of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al, (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 

15 Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 
4 only lists amino acid residues comprising epitopes predicted to have the highest 
degree of antigenicity. The polypeptides not listed in Table 4 and portions of 
polypeptides not listed in Table 4 are not considered non-antigenic. This is because 
they may still be antigenic in vivo but merely not recognized as such by the particular 

20 algorithm used. Thus, Table 4 lists the amino acid residues comprising preferred 
antigenic epitopes but not a complete list. Amino acid residues comprising other 
anigenic epitopes may be determined by algorithms similar to the Jameson- Wolf 
analysis or by in vivo testing for an antigenic response using the methods described 
herein or those known in the art. 

25 As to the selection of peptides or polypeptides bearing an antigenic epitope 

(i.e., that contain a region of a protein molecule to which an antibody can bind), it is 
well known in that art that relatively short synthetic peptides that mimic part of a 
protein sequence are routinely capable of eliciting an antiserum that reacts with the 
partially mimicked protein. See, e.g., Sutcliffe, et al., (1983) Science 219:660-666. 
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Peptides capable of eliciting protein-reactive sera are frequently represented in the 
primary sequence of a protein, can be characterized by a set of simple chemical rules, 
and are confined neither to immunodominant regions of intact proteins (i.e., 
immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are 

5 extremely hydrophobic and those of six or fewer residues generally are ineffective at 
inducing antibodies that bind to the mimicked protein; longer, peptides, especially 
those containing proline residues, usually are effective. See, Sutcliffe, et ah, supra, p. 
661. For instance, 18 of 20 peptides designed according to these guidelines, containing 
8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 

10 polypeptide chain, induced antibodies that reacted with the HA1 protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 1 8/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 

15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas 
obtained by fusion of spleen cells from donors immunized with an antigen 
epitope-bearing peptide generally secrete antibody reactive with the native protein. 
See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic epitope-bearing 
peptides or polypeptides are useful to detect the mimicked protein, and antibodies to 

20 different peptides may be used for tracking the fate of various regions of a protein 
precursor which undergoes post-translational processing. The peptides and 
anti-peptide antibodies may be used in a variety of qualitative or quantitative assays 
for the mimicked protein, for instance in competition assays since it has been shown 
that even short peptides (e.g., about 9 amino acids) can bind and displace the larger 

25 peptides in immunoprecipitation assays. See, e.g., Wilson, et al., (1984) Cell 
37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography 
using methods known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
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designed according to the above guidelines preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 10 to about 
50 amino acids (i.e. any integer between 7 and 50) contained within the amino acid 
sequence of a polypeptide of the invention. However, peptides or polypeptides 

5 comprising a larger portion of an amino acid sequence of a polypeptide of the 
invention, containing about 50 to about 100 amino acids, or any length up to and 
including the entire amino acid sequence of a polypeptide of the invention, also are 
considered epitope-bearing peptides or polypeptides of the invention and also are 
useful for inducing antibodies that react with the mimicked protein. Preferably, the 

10 amino acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used 

15 to generate an enterococcal-specific immune response or antibodies include portions of 
the amino acid sequences identified in Table 1. More specifically, Table 4 discloses a 
list of non-limiting residues that are involved in the antigenicity of the epitope-bearing 
fragments of the present invention. Therefore, the present inventions provides for 
isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the 

20 present invention comprising a peptide sequences of Table 4. The antigenic epitope- 
bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
sequence of at least seven, more preferably at least nine and most preferably between 
about 1 0 to about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide 
of the present invention. That is, included in the present invention are antigenic 

25 polypeptides between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4. Therefore, in most cases, the polypeptides of 
Table 4 make up only a portion of the antigenic polypeptide. All combinations of 
sequences between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4 are included. The antigenic epitope-bearing 
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fragements may be specified by either the number of contiguous amino acid residues 
or by specific N- terminal and C-terminal positions as described above for the 
polypeptide fragements of the present invention, wherein the initiation codon is 
residue 1 . Any number of the described antigenic epitope-bearing fragements of the 
5 present invention may also be excluded from the present invention in the same 
manner. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides including 
recombinant means using nucleic acid molecules of the invention. For instance, an 

10 epitope-bearing amino acid sequence of the present invention may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and purification, as 
well as during immunization to produce anti-peptide antibodies. Epitope-bearing 
peptides also may be synthesized using known methods of chemical synthesis. For 
instance, Houghten has described a simple method for synthesis of large numbers of 

15 peptides, such as 10-20 mg of 248 different 13 residue peptides representing single 
amino acid variants of a segment of the HA1 polypeptide which were prepared and 
characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R. 
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple 
Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,21 1 

20 to Houghten and coworkers (1986). In this procedure the individual resins for the 
solid-phase synthesis of various peptides are contained in separate solvent-permeable 
packets, enabling the optimal use of the many identical repetitive steps involved in 
solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. 

25 ScL 82:5131-5135 at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art See, e.g., Sutcliffe, et al., 
supra;; Wilson, et al., supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. 
Generally, animals may be immunized with free peptide; however, anti-peptide 
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antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, 
such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides 
containing cysteine may be coupled to carrier using a linker such as 
m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may 

5 be coupled to carrier using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or 
carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection 
of emulsions containing about 100 \ig peptide or carrier protein and Freund's adjuvant. 
Several booster injections may be needed, for instance, at intervals of about two 

10 weeks, to provide a useful titer of anti-pep tide antibody which can be detected, for 
example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a 
solid support and elution of the selected antibodies according to methods well known 

15 in the art. 

Immunogenic epitope-bearing peptides of the invention, le.> those parts of a 
protein that elicit an antibody response when the whole protein is the immunogen, are 
identified according to methods known in the art. For instance, Geysen, et al, supra, 
discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of 

20 peptides of sufficient purity to react in an ELISA. Interaction of synthesized 
peptides with antibodies is then easily detected without removing them from the 
support. In this manner a peptide bearing an immunogenic epitope of a desired 
protein may be identified routinely by one of ordinary skill in the art. For instance, 
the immunologically important epitope in the coat protein of foot-and-mouth disease 

25 virus was located by Geysen et ai supra with a resolution of seven amino acids by 
synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 
213 amino acid sequence of the protein. Then, a complete replacement set of peptides 
in which all 20 amino acids were substituted in turn at every position within the 
epitope were synthesized, and the particular amino acids conferring specificity for the 
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reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing 
peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide 
bearing an immunogenic epitope of a desired protein. 

5 Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a.general 

method of detecting or determining the sequence of monomers (amino acids or other 
compounds) which is a topological equivalent of the epitope (i.e., a "mimotope") 
which is complementary to a particular paratope (antigen binding site) of an antibody 
of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), 

10 describes a method of detecting or determining a sequence of monomers which is a 
topographical equivalent of a ligand which is complementary to the ligand binding site 
of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, 
R. A. et al. (1996) discloses linear CrCy-alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 

15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially 
binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these 
methods. The entire disclosure of each document cited in this section on 
"Polypeptides and Fragments" is hereby incorporated herein by reference. 

20 As one of skill in the art will appreciate, the polypeptides of the present 

invention and the epitope-bearing fragments thereof described above can be combined 
with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric 
polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the 

25 first two domains of the human CD4-polypeptide and various domains of the 

constant regions of the heavy or light chains of mammalian immunoglobulins. (EPA 
0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a 
disulfide-linked dimeric structure due to the IgG part can also be more efficient in 
binding and neutralizing other molecules than a monomelic E.faecalis polypeptide or 



WO 98/50554 



-41- 



PCT/US98/08959 



fragment thereof alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. 
Nucleic acids encoding the above epitopes of E, faecalis polypeptides can also be 
recombined with a gene of interest as an epitope tag to aid in detection and 
purification of the expressed polypeptide. 

5 

Antibodies 

E. faecalis protein-specific antibodies for use in the present invention can be 
raised against the intact E. faecalis protein or an antigenic polypeptide fragment 
thereof, which may be presented together with a carrier protein, such as an albumin, to 

10 an animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 
amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, single chain whole antibodies, and antibody 
fragments. Antibody fragments of the present invention include Fab and F(ab')2 and 

15 other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv). 
Also included in the present invention are chimeric and humanized monoclonal 
antibodies and polyclonal antibodies specific for the polypeptides of the present 
invention. The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, cells expressing a polypeptide of the present 

20 invention or an antigenic fragment thereof can be administered to an animal in order to 
induce the production of sera containing polyclonal antibodies. For example, a 
preparation of E. faecalis polypeptide or fragment thereof is prepared and purified to 
render it substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antisera of greater specific 

25 activity. 

In a preferred method, the antibodies of the present invention are monoclonal 
antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared 
using hybridoma technology. See, e.g., Harlow et al., ANTIBODIES: A 
LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); 
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Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL 
HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and F(ab')2 fragments may be 
produced by proteolytic cleavage, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, E. faecalis 
5 polypeptide-binding fragments, chimeric, and humanized antibodies can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry using methods known in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide 
antigen of the present invention may be produced in a two-step procedure through the 

10 use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody which 
binds to a second antibody. In accordance with this method, E. faecalis 
polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. 
The splenocytes of such an animal are then used to produce hybridoma cells, and the 

15 hybridoma cells are screened to identify clones which produce an antibody whose 
ability to bind to the E. faecalis polypeptide-specific antibody can be blocked by the 
E.faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to 
the E.faecalis polypeptide-specific antibody and can be used to immunize an animal 
to induce formation of further E.faecalis polypeptide-specific antibodies. 

20 Antibodies and fragements thereof of the present invention may be described 

by the portion of a polypeptide of the present invention recognized or specifically 
bound by the antibody. Antibody binding fragements of a polypeptide of the present 
invention may be described or specified in the same manner as for polypeptide 
fragements discussed above., i.e, by N-terminal and C-terminal positions or by size in 

25 contiguous amino acid residues. Any number of antibody binding fragments, of a 
polypeptide of the present invention, specified by N-terminal and C-terminal 
positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the 
specifically bind a particuarlly discribed fragement of a polypeptide of the present 
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invention and allows for the exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be 
described or specified in terms of their cross-reactivity. Antibodies and fragements 
that do not bind polypeptides of any other species of Enterococcus other than E. 
5 faecalis are included in the present invention. Likewise, antibodies and fragements 
that bind only species of Enterococcus, i.e. antibodies and fragements that do not bind 
bacteria from any genus other than Enterococcus, are included in the present 
invention. 

1 0 Diagnostic Assays 

The present invention further relates to methods for assaying staphylococcal 
infection in an animal by detecting the expression of genes encoding staphylococcal 
polypeptides of the present invention. The methods comprise analyzing tissue or 
body fluid from the animal for Enterococcus-specific antibodies, nucleic acids, or 

15 proteins. Analysis of nucleic acid specific to Enterococcus is assayed by PCR or 
hybridization techniques using nucleic acid sequences of the present invention as 
either hybridization probes or primers. See, e.g., Sambrook et al. Molecular cloning: 
A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing 

20 differentiation among spotted fever group Rickettsiae species by analysis of restriction 
fragment length polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. 
Microbiol. 32:589-595 (detecting 5. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

25 regression of the disease state whereby patients exhibiting enhanced Enterococcus 

gene expression will experience a worse clinical outcome relative to patients expressing 
these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an 
animal, cell line, tissue culture, or other source which contains Enterococcus 
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polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, 
blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and 
cartilage) and any other biological source suspected of containing Enterococcus 
polypeptides or nucleic acids. Methods for obtaining biological samples such as 
5 tissue are well known in the art. 

The present invention is useful for detecting diseases related to Enterococcus 
infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, 
cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 
Total RNA can be isolated from a biological sample using any suitable 

10 technique such as the single-step guamdinium-thiocyanate-phenol -chloroform method 
described in Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding 
Enterococcus polypeptides having sufficient homology to the nucleic acid sequences 
identified in Table 1 to allow for hybridization between complementary sequences are 
then assayed using any appropriate method. These include Northern blot analysis, SI 

15 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in 

combination with the polymerase chain reaction (RT-PCR), and reverse transcription 
in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) 
Cell 63:303-312. Briefly, total RNA is prepared from a biological sample as described 

20 above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel 
electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 

25 phosphate buffer. A E. faecalis polynucleotide sequence shown in Table 1 labeled 
according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is 
washed and exposed to x-ray film. DNA for use as probe according to the present 
invention is described in the sections above and will preferably at least 1 5 nucleotides 
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in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 
49:357-367. To prepare probe DNA for use in SI mapping, the sense strand of an 
above-described E.faecalis DNA sequence of the present invention is used as a 
5 template to synthesize labeled antisense DNA. The antisense DNA can then be 
digested using an appropriate restriction endonuclease to generate further DNA 
probes of a desired length. Such antisense probes are useful for visualizing protected 
bands corresponding to the target mRNA (/.c, mRNA encoding Enterococcus 
polypeptides). 

10 Levels of mRNA encoding Enterococcus polypeptides are assayed, for e.g., 

using the RT-PCR method described in Makino et al. (1990) Technique 2:295-301. 
By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands 
are linearly related to the initial concentration of the target mRNA. Briefly, this 
method involves adding total RN A isolated from a biological sample in a reaction 

15 mixture containing a RT primer and appropriate buffer. After incubating for primer 
annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase 
inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
of the RNA, the RT products are then subject to PCR using labeled primers. 
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 

20 PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler 
according to conventional techniques. After a suitable number of rounds to achieve 
amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. 
After drying the gel, the radioactivity of the appropriate bands (corresponding to the 
mRNA encoding the Enterococcus polypeptides of the present invention) are 

25 quantified using an imaging analyzer. RT and PCR reaction ingredients and 

conditions, reagent and gel concentrations, and labeling methods are well known in the 
art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other 
PCR methods that can detect the nucleic acid of the present invention can be found in 
PCR PRIMER: A LABORATORY MANUAL (C.W. Dieffenbach et al. eds., Cold 
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Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, 
may be used to detect polynucleotides of the present invention or Enterococcal 
species including E. faecalis using bio chip technology. The present invention 

5 includes both high density chip arrays (>1000 oligonucleotides per cm 2 ) and low 

density chip arrays (<1000 oligonucleotides per cm 2 ). Bio chips comprising arrays of 
polynucleotides of the present invention may be used to detect Enterococcal species, 
including E. faecalis^ in biological and environmental samples and to diagnose an 
animal, including humans, with an E. faecalis or other Enterococcal infection. The bio 

10 chips of the present invention may comprise polynucleotide sequences of other 

pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in 
addition to the polynucleotide sequences of the present invention, for use in rapid 
diffenertial pathogenic detection and diagnosis. The bio chips can also be used to 
monitor an E. faecalis or other Enterococcal infections and to monitor the genetic 

15 changes (deletions, insertions, mismatches, etc) in response to drug therapy in the 
clinic and drug development in the laboratory. The bio chip technology comprising 
arrays of polynucleotides of the present invention may also be used to simultaneously 
monitor the expression of a multiplicity of genes, including those of the present 
invention. The polynucleotides used to comprise a selected array may be specified in 

20 the same manner as for the fragements, i.e, by their 5' and 3* positions or length in 
contigious base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 

25 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/951 1995, WO/9743447, WO/9535505, each incorporated herein in their 
entireties. 

Biosensors using the polynucleotides of the present invention may also be 
used to detect, diagnose, and monitor E. faecalis or other Enterococcal species and 
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infections thereof. Biosensors using the polynucleotides of the present invention may 
also be used to detect particular polynucleotides of the present invention. Biosensors 
using the polynucleotides of the present invention may also be used to monitor the 
genetic changes (deletions, insertions, mismatches, etc.) in response to drug therapy in 
5 the clinic and drug development in the laboratory. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using biosenors include those known in the art and those of: U.S. Patent Nos 
5721 102, 5658732, 5631 170, and World Patent Nos. WO97/3501 1, WO/9720203, 
each incorporated herein in their entireties. 

10 Thus, the present invention includes both bio chips and biosensors comprising 

polynucleotides of the present invention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur 
using any art-known method, such as antibody-based techniques. For example, 
Enterococcus polypeptide expression in tissues can be studied with classical 

15 immunohistological methods. In these, the specific recognition is provided by the 
primary antibody (polyclonal or monoclonal) but the secondary detection system can 
utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an 
immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of 

20 Enterococcus polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. 
et al. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 
105:3087-3096. In this technique, which is based on the use of cationic solid phases, 
quantitation of a Enterococcus polypeptide can be accomplished using an isolated 
Enterococcus polypeptide as a standard. This technique can also be applied to body 

25 fluids. 

Other antibody-based methods useful for detecting Enterococcus polypeptide 
gene expression include immunoassays, such as the ELIS A and the radioimmunoassay 
(RIA). For example, a Enterococcus polypeptide-specific monoclonal antibodies can 
be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and 
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quantify a Enterococcus polypeptide. The amount of a Enterococcus polypeptide 
present in the sample can be calculated by reference to the amount present in a 
standard preparation using a linear regression computer algorithm. Such an ELISA is 
described in lacobelli et al. (1988) Breast Cancer Research and Treatment 1 1:19-30. In 

5 another ELISA assay, two distinct specific monoclonal antibodies can be used to 

detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies 
is used as the immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting the Enterococcus 

10 polypeptide with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before contacting 
the mixture with the labeled antibody. Other conventional methods may also be 
employed as suitable. It is usually desirable to immobilize one component of the 
assay system on a support, thereby allowing other components of the system to be 

15 brought into contact with the component and readily removed from the sample. 
Variations of the above and other immunological methods included in the present 
invention can also be found in Harlow et al., ANTIBODIES: A LABORATORY 
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, 

20 which catalyze the production of hydrogen peroxide by reacting with substrate. 
Glucose oxidase is particularly preferred as it has good stability and its substrate 
(glucose) is readily available. Activity of an oxidase label may be assayed by 
measuring the concentration of hydrogen peroxide formed by the enzyme-labeled 
antibody/substrate reaction. Besides enzymes, other suitable labels include 

25 radioisotopes, such as iodine ( 125 I, l21 I), carbon ( 14 C), sulphur ( 35 S), tritium ( 3 H), 

indium ( U2 In), and technetium (""Tc), and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of 
the present invention are provided below. Examples of suitable enzyme labels include 
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malate dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
5 acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, 1 1 'in, l25 1, 131 1, 32 P, 35 S, 
I4 C, 5I Cr, 57 To, 58 Co, 59 Fe, 75 Se, 152 Eu, 90 Y, 67 Cu, 217 Ci, 21 'At, 2I2 Pb, 47 Sc, ,09 Pd, etc. 
111 In is a preferred isotope where in vivo imaging is used since its avoids the problem 
of dehalogenation of the 125 I or 13l I-labeled monoclonal antibody by the liver. In 
10 addition, this radionucleotide has a more favorable gamma emission energy for imaging. 
See, e.g., Perkins et al. (1985) Eur. J. Nucl. Med. 10:296-301; Carasquillo et al. 
(1987) J. Nucl. Med. 28:281-287. For example, U1 ln coupled to monoclonal 
antibodies with 1 -(P-isothiocyanatobenzyl)-DPTA has shown little uptake in 
non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include 157 Gd, 55 Mn, 
162 Dy, 52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an l52 Eu label, a fluorescein 
label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin 
20 label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 
ricin, and cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an 
25 oxalate ester label, a luciferin label, a luciferase label, and an aequorin label 

Examples of nuclear magnetic resonance contrasting agents include heavy metal 
nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are 
provided by Kennedy et al. (1976) Clin. Chim. Acta 70:1-3 1, and Schurs et al. (1977) 
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Clin. Chim. Acta 8 1 : 1-40. Coupling techniques mentioned in the latter are the 
glutaraldehyde method, the periodate method, the dimaleimide method, the 
m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are 
incorporated by reference herein. 
5 In a related aspect, the invention includes a diagnostic kit for use in screening 

serum containing antibodies specific against E. faecalis infection. Such a kit may 
include an isolated E. faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one anti-is. faecalis antibody. Such a kit also includes 
means for detecting the binding of said antibody to the antigen. In specific 

10 embodiments, the kit may include a recombinantly produced or chemically 

synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen 
may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit 
includes a solid support to which said peptide or polypeptide antigen is attached. 

15 Such a kit may also include a non-attached reporter-labeled anti-human antibody. In 
this embodiment, binding of the antibody to the E. faecalis antigen can be detected by 
binding of the reporter labeled antibody to the anti-is. faecalis polypeptide antibody. 

In a related aspect, the invention includes a method of detecting E. faecalis 
infection in a subject. This detection method includes reacting a body fluid, preferably 

20 serum, from the subject with an isolated E. faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the 
support. Subsequently, the support is reacted with a reporter-labeled anti-human 
antibody. The support is then examined for the presence of reporter-labeled 

25 antibody. 

The solid surface reagent employed in the above assays and kits is prepared 
by known techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-well plates or filter material. These attachment 
methods generally include non-specific adsorption of the protein to the support or 
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covalent attachment of the protein , typically through a free amine group, to a 
chemically reactive group on the solid support, such as an activated carboxyl, 
hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). 

5 The polypeptides and antibodies of the present invention, including fragments 

thereof, may be used to detect Enterococcal species including E.faecalis using bio chip 
and biosensor technology. Bio chip and biosensors of the present invention may 
comprise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Enterococcal species, including E.faecalis. Bio chip and 

10 biosensors of the present invention may also comprise antibodies which specifically 
recognize the polypeptides of the present invention to detect Enterococcal species, 
including E.faecalis or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be 
used to detect Enterococcal species, including E.faecalis^ in biological and 

15 environmental samples and to diagnose an animal, including humans, with an E. 

faecalis or other Enterococcal infection. Thus, the present invention includes both bio 
chips and biosensors comprising polypeptides or antibodies of the present invention 
and methods of their use. 

The bio chips of the present invention may further comprise polypeptide 

20 sequences of other pathogens including bacteria, viral, parasitic, and fungal 

polypeptide sequences, in addition to the polypeptide sequences of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio 
chips of the present invention may further comprise antibodies or fragements thereof 
specific for other pathogens including bacteria, viral, parasitic, and fungal polypeptide 

25 sequences, in addition to the antibodies or fragements thereof of the present invention, 
for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an E.faecalis or other 
Enterococcal infection and to monitor the genetic changes (amio acid deletions, 
insertions, substitutions, etc.) in response to drug therapy in the clinic and drug 
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development in the laboratory. The bio chip and biosensors comprising polypeptides 
or antibodies of the present invention may also be used to simultaneously monitor the 
expression of a multiplicity of polypeptides, including those of the present invention. 
The polypeptides used to comprise a bio chip or biosensor of the present invention 
5 may be specified in the same manner as for the fragements, i.e, by their N-terminal and 
C-terminal positions or length in contigious amino acid residue. Methods and 
* particular uses of the polypeptides and antibodies of the present invention to detect 
Enterococcal species, including E.faecalis, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. 
10 and World Patent Nos. listed above for bio chips and biosensors using 

polynucleotides of the present invention, and those of: U.S. Patent Nos. 5658732, 
5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, 
W096 12957, each incorporated herein in their entireties. 

15 Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify 
those which enhance or block the biological activity of the E.faecalis polypeptides of 
the present invention. The present invention further provides where the compounds 
20 kill or slow the growth of E.faecalis. The ability of E. faecalis antagonists, including 
E.faecalis ligands, to prophylactically or therapeutically block antibiotic resistance 
may be easily tested by the skilled artisan. See, e.g., Straden et al. (1997) J Bacteriol. 
179(1):9-16. 

An agonist is a compound which increases the natural biological function or 
25 which functions in a manner similar to the polypeptides of the present invention, 
while antagonists decrease or eliminate such functions. Potential antagonists include 
small organic molecules, peptides, polypeptides, and antibodies that bind to a 
polypeptide of the invention and thereby inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross 
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bridge formation. Antibodies against Kfaecalis may be employed to bind to and 
inhibit E.faecalis activity to treat antibiotic resistance. Any of the above antagonists 
may be employed in a composition with a pharmaceutical^ acceptable carrier. 

5 Vaccines 

The present invention also provides vaccines comprising one or more 

polypeptides of the present invention. Heterogeneity in the composition of a vaccine 

may be provided by combining E. faecalis polypeptides of the present invention. 

Multi-component vaccines of this type are desirable because they are likely to be 
10 more effective in eliciting protective immune responses against multiple species and 

strains of the Enterococcus genus than single polypeptide vaccines. 

Multi-component vaccines are known in the art to elicit antibody production 

to numerous immunogenic components. See, e.g., Decker et al. (1996) J. Infect. Dis. 

174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 
15 vaccine has recently been demonstrated to elicit protective levels of antibodies in 

human infants against all four pathogenic agents. See, e.g., Aristegui, J. et al. (1997) 

Vaccine 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more than one polypeptide, 

20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine 

comprising more than one of the E.faecalis polypeptides of the present invention. 

Further within the scope of the invention are whole cell and whole viral 
vaccines. Such vaccines may be produced recombinantly and involve the expression 
of one or more of the E. faecalis polypeptides described in Table 1 . For example, the 

25 E.faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on ihe cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E. faecalis polypeptides of the present invention may, 
for example, be localized in the viral envelope, on the surface of the capsid, or 
internally within the capsid. Whole cells vaccines which employ cells expressing 
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heterologous proteins are known in the art. See, e.g., Robinson, K. et al. (1997) 
Nature Biotech. 15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; 
Chabalgoity, J. etal. (1997) Infect. Immun. 65:2402-2412. These cells may be 
administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, 

5 for example, report the successful use in mice of a live attenuated Salmonella vaccine 
strain which expresses a portion of a platyhelminth fatty acid-binding protein as a 
fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in 
the art by combining one or more E. faecalis polypeptides of the present invention, or 

10 fragments thereof, with additional non-Enterococcal components (e.g., diphtheria 
toxin or tetanus toxin, and/or other compounds known to elicit an immune response). 
Such vaccines are useful for eliciting protective immune responses to both members of 
the Enterococcus genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA 

15 vaccines are currently being developed for a number of infectious diseases. See, et al, 
Boyer, et al. (1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 
14:1285-1288. Such DNA vaccines contain anucleotide sequence encoding one or 
more E. faecalis polypeptides of the present invention oriented in a manner that 
allows for expression of the subject polypeptide. For example, the direct 

20 administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to 
elicit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J. 
Infect. Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 

25 (1997) Nature Biotech. 1 5:641-646, for example, report the enhancement of immune 
responses produced by DNA immunizations when DNA sequences encoding 
molecules which stimulate the immune response are co-administered. In a similar 
fashion, the vaccines of the present invention may be co-administered with either 
nucleic acids encoding immune modulators or the immune modulators themselves. 
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These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by either passive or active immunization. When the vaccines of 

5 the present invention are used to confer resistance to Enterococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective immune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
Enterococcal infection through passive immunization, the vaccine is provided to a host 

10 animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is 

recovered and directly provided to a recipient suspected of having an infection caused 
by a member of the Enterococcus genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules 
provides an additional method for treating Enterococcal infections when passive 

15 immunization is conducted. In this embodiment, antibodies, or fragments of 

antibodies, capable of recognizing the E.faecalis polypeptides disclosed herein, or 
fragments thereof, as well as other Enterococcus proteins, are labeled with toxin 
molecules prior to their administration to the patient. When such toxin derivatized 
antibodies bind to Enterococcus cells, toxin moieties will be localized to these cells and 

20 will cause their death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 
present invention. As used herein, a vaccine is said to prevent or attenuate a disease if 

25 its administration to an animal results either in the total or partial attenuation (i.e., 
suppression) of a symptom or condition of the disease, or in the total or partial 
immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 
either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, 



WO 98/50554 



-56- 



PCT/US98/08959 



the compound(s) are provided in advance of any symptoms of Enterococcal infection. 
The prophylactic administration of the compound(s) serves to prevent or attenuate 
any subsequent infection. When provided therapeutically, the compound(s) is 
provided upon or after the detection of symptoms which indicate that an animal may 
5 be infected with a member of the Enterococcus genus. The therapeutic administration 
of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis 
polypeptides, and fragments thereof, of the present invention may be provided either 
prior to the onset of infection (so as to prevent or attenuate an anticipated infection) 
or after the initiation of an actual infection. 

10 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
carbohydrates. Suitable proteins which may act as macromolecular carrier for 
enhancing the immunogenicity of the polypeptides of the present invention include 

15 keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum 
albumin, and ovalbumin. Methods for coupling the polypeptides of the present 
invention to such macromolecular carriers are disclosed in Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). 

20 A composition is said to be "pharmacologically or physiologically acceptable" 

if its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 
"therapeutically effective amount" if the amount administered is physiologically 
significant. An agent is physiologically significant if its presence results in a 

25 detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that 
the composition of a pharmacologically acceptable compound varies with the animal 
to which it is administered. For example, a vaccine intended for human use will 
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generally not be co-administered with Freund's adjuvant. Further, the level of purity 
of the E. faecalis polypeptides of the present invention will normally be higher when 
administered to a human than when administered to a non-human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine 

5 of the present invention is provided to an animal, it may be in a composition which 
may contain salts, buffers, adjuvants, or other substances which are desirable for 
improving the efficacy of the composition. Adjuvants are substances that can be used 
to specifically augment a specific immune response. These substances generally 
perform two functions: (1) they protect the antigen(s) from being rapidly catabolized 

1 0 after administration and (2) they nonspecifically stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to 
the immune system, or presented separately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 
composition. These groups include oil adjuvants (for example, Freund's complete and 

15 incomplete), mineral salts (for example, A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacterium tuberculosis, as 
well as substances found in Corynebacterium parvum, or Bordetella pertussis, and 
members of the genus Brucella. Other substances useful as adjuvants are the saponins 

20 such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in 
the present invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and 
A1NH 4 (S0 4 ). Examples of materials suitable for use in vaccine compositions are 
provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. 
Osol, ed, Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

25 The therapeutic compositions of the present invention can be administered 

parenterally by injection, rapid infusion, nasopharyngeal absorption 
(intranasopharangeally), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, 
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suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such 
as ethyl oleate. Carriers or occlusive dressings can be used to increase skin 
permeability and enhance antigen absorption. Liquid dosage forms for oral 

5 administration may generally comprise a liposome solution containing the liquid 
dosage form. Suitable forms for suspending liposomes include emulsions, suspen- 
sions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, 
such as purified water. Besides the inert diluents, such compositions can also include 
adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 

1 0 flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated 
in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, 
Shahin, R. et al. (1995) Infect. Immun. 63:1 195-1200. Similarly, orally administered 

15 encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K. 
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving 
contacting the vaccine with mucous membranes (e.g., intranasally, intracolonicly, 
intraduodenally). 

20 Many different techniques exist for the timing of the immunizations when a 

multiple administration regimen is utilized. It is possible to use the compositions of 
the invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 
immunizations are given, they will be given one to two months apart. 

25 According to the present invention, an "effective amount" of a therapeutic 

composition is one which is sufficient to achieve a desired biological effect. Generally, 
the dosage needed to provide an effective amount of the composition will vary 
depending upon such factors as the animal's or human's age, condition, sex, and extent 
of disease, if any, and other variables which can be adjusted by one of ordinary skill in 
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the art. 

The antigenic preparations of the invention can be administered by either 
single or multiple dosages of an effective amount. Effective amounts of the 
compositions of the invention can vary from 0.01-1,000 \xg/ml per dose, more 
5 preferably 0. 1 -500 \xg/m\ per dose, and most preferably 1 0-300 \xg/m\ per dose. 

Examples 

Example 1: Isolation of a Selected DNA Clone From the Deposited Sample of E. 
faecalis 

10 Three approaches can be used to isolate a E. faecalis clone comprising a 

polynucleotide of the present invention from any E, faecalis genomic DNA library. 
The E. faecalis strain V586 has been deposited as a convienent source for obtaining a 
E. faecalis strain although a wide varity of strains E. faecalis strains can be used which 
are known in the art. 

15 E. faecalis genomic DNA is prepared using the following method. A 20ml 

overnight bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain 
Heart Infusion broth or Super broth), pelleted, washed two times with TES (30mM 
Tris-pH 8.0, 25mM EDTA, 50mM NaCl), and resuspended in 5ml high salt TES 
(2.5M NaCl). Lysostaphin is added to final concentration of approx 50ug/ml and the 

20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then 
placed in incubator (or place in a shaking water bath) and warmed to 55C. Five 
hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is then added to 
lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g in 
5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). 

25 A CsCl gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and 
overlaying with 2.85M CsCl. The gradient is carefully overlayed with the DNA- 
containing GuHCl solution. The gradient is spun at 30,000 rpm, 20C for 24 hr and 
the lower DNA band is collected. The volume is increased to 5 ml with TE buffer. 
The DNA is then treated with protease K (10 ug/ml) overnight at 37 C, and 
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precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid E. 
faecalis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 
5 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer 
according to the sequence reported. The oligonucleotide is labeled, for instance, with 
32 P-y-ATP using T4 polynucleotide kinase and purified according to routine methods. 
{See, e.g., Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring, NY (1982).) The library is transformed into a suitable 

10 host, as indicated above (such as XL-1 Blue (Stratagene)) using techniques known to 
those of skill in the art. See, e.g., Sambrook et al. MOLECULAR CLONING; A 
LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., 
CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, 
N.Y. 1989). The transformants are plated on 1 .5% agar plates (containing the 

15 appropriate selection agent, e.g., ampicillin) to a density of about 150 transformants 
(colonies) per plate. These plates are screened using Nylon membranes according to 
routine methods for bacterial colony screening. See, e.g., Sambrook et al. 
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, 
N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 

20 BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of 
skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends 
of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA 
by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out 
25 under routine conditions, for instance, in 25 fo.1 of reaction mixture with 0.5 ug of the 
above DNA template. A convenient reaction mixture is 1 .5-5 mM MgCl 2 , 0.01% 
(w/v) gelatin, 20 \M each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94°C for 1 
min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are performed with a 
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Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed by 
agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 
5 Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

synthesized and used to generate a nucleotide sequence of desired length using PCR 
methods known in the art. 

Example 2(a): Expression and Purification Enterococcal polypeptides in E. coli 

10 The bacterial expression vector pQE60 was used for bacterial expression of 

some of the polypeptide fragements used in the soft tissue and systemic infection 
models discussed below. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome binding 

1 5 site ("RBS"), six codons encoding histidine residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) 
and suitable single restriction enzyme cleavage sites. These elements are arranged such 
that an inserted DNA fragment encoding a polypeptide expresses that polypeptide 
with the six His residues (i.e., a "6 X His tag") covalently linked to the carboxyl 

20 terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a E. faecalis protein of the 
present invention was amplified from E. faecalis genomic DNA using PCR 
oligonucleotide primers which anneal to the 5* and 3' sequences coding for the 
portions of the E. faecalis polynucleotide shown in Table 1 . Additional nucleotides 

25 containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' 
and 3 ! sequences, respectively. 

For cloning the mature protein, the 5 ! primer has a sequence containing an 
appropriate restriction site followed by nucleotides of the amino terminal coding 
sequence of the desired E. faecalis polynucleotide sequence in Table 1. One of 
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ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of the complete protein shorter or longer than the mature form. 
The 3 ! primer has a sequence containing an appropriate restriction site followed by 
5 nucleotides complementary to the 3 ! end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the coding sequence aligned with the restriction site so 
as to maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified E.faecalis DNA fragment and the vector pQE60 were digested 
with restriction enzymes which recognize the sites in the primers and the digested 

10 DNAs were then ligated together. The E. faecalis DNA was inserted into the 

restricted pQE60 vector in a manner which places the E. faecalis protein coding region 
downstream from the IPTG-inducible promoter and in- frame with an initiating AUG 
and the six histidine codons. 

The ligation mixture was transformed into competent E. coli cells using 

15 standard procedures such as those described by Sambrook et al., supra,, E. coli strain 
M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac 
repressor and confers kanamycin resistance ("Kanr"), was used in carrying out the 
illustrative example described herein. This strain, which was only one of many that 
are suitable for expressing a E.faecalis polypeptide, is available commercially 

20 (QIAGEN, Inc., supra), Transformants were identified by their ability to grow on LB 
agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated 
from resistant colonies and the identity of the cloned DNA confirmed by restriction 
analysis, PCR and DNA sequencing. 

Clones containing the desired constructs were grown overnight ("O/N") in 

25 liquid culture in LB media supplemented with both ampicillin (100 |lg/ml) and 
kanamycin (25 |ig/ml). The O/N culture was used to inoculate a large culture, at a 
dilution of approximately 1 :25 to 1:250. The cells were grown to an optical density at 
600 ran ( M OD600 H ) of between 0.4 and 0.6. Isopropyl-p-D-thiogalactopyranoside 
("IPTG") was then added to a final concentration of 1 mM to induce transcription 
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from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently were incubated further for 3 to 4 hours. Cells then were harvested by 
centrifugation. 

The cells were then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. 
5 The cell debris was removed by centrifugation, and the supernatant containing the E. 
faecalis polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind to the 
Ni-NTA resin with high affinity were purified in a simple one-step procedure (for 
details see: The QIAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the 
10 supernatant was loaded onto the column in 6 M guanidine-HCl, pH 8, the column was 
first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 
volumes of 6 M guanidine-HCl pH 6, and finally the E. faecalis polypeptide was 
eluted with 6 M guanidine-HCl, pH 5. 

The purified protein was then renatured by dialyzing it against 
15 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM 
NaCL Alternatively, the protein could be successfully refolded while immobilized on 
the Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The renaturation should be performed over a period of 
20 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 
250 mM immidazole. Immidazole was removed by a final dialyzing step against PBS 
or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein was 
stored at 4° C or frozen at -80° C. 

Some of the polypeptide of the present invention were prepared using a non- 
25 denaturing protein purification method. For these polypeptides, the cell pellet from 
each liter of culture was resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer 
A = 50 mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% 
Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail 
(Boehringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 tun was 
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approximately 10-20 O.D./ml. The suspension was then put through three 
freeze/thaw cycles from -70°C (using a ethanol-dry ice bath) up to room temperature. 
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at 
approximately 80W while kept on ice. The sonicated sample was then centrifuged at 
5 15,000 RPM for 30 minutes at 4°C. The supernatant was passed through a column 
containing 1 .0 ml of CL-4B resin to pre-clear the sample of any proteins that may 
bind to agarose non-specifically, and the flow-through fraction was collected. 

The pre-cleared flow-through was applied to a nickel-nitrilo-tri-acetic acid 
("Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag 

10 bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 
4°C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 
of the eluate returns to the baseline. Then, the column was washed with 5 volumes of 
40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) (Buffer B = 50 mM Na- 

15 Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM 
Imidazole, pH of the final buffer should be 7.5). The protein was eluted off of the 
column with a series of increasing Imidazole solutions made by adjusting the ratios of 
Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of 
75 mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 

20 Imidazole. The fractions containing the purified protein were analyzed using 8 %, 10 
% or 14% SDS-PAGE depending on the protein size. The purified protein was then 
dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily 
workable buffer. The purified protein was stored at 4° C or frozen at -80°. 

The following alternative method may be used to purify Kfaecalis expressed 

25 in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-1 0°C and the cells are harvested by continuous centrifiigation at 
1 5,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
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unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 
5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 1 5 min. The resultant pellet is washed again using 0.5M 
NaCl, 1 00 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 1 5 min., the 
pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E.faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 \im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

25 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E.faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 
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Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
5 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. 
Fractions are collected under constant A 2 8o monitoring of the effluent. Fractions 
containing the E. faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant £. faecalis polypeptide exhibits greater than 95% purity after 
l o the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 \xg of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0.1 ng/ml according to LAL assays. 

1 5 * Example 2(b): Alternative Expression and Purification Enterococcal polypeptides in E. 
coli 

Tthe vector pQElO was alternatively used to clone and express some of the 
polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 

20 encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a H 6 
X His tag 1 ') covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311) was used in this example . The components of the pQE 1 0 plasmid are 
arranged such that the inserted DNA sequence encoding a polypeptide of the present 

25 invention expresses the polypeptide with the six His residues (i.e., a "6 X His tag")) 
covalently linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 
1 were amplified using PCR oligonucleotide primers from genomic E. faecalis DNA. 
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid 
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sequence of a polypeptide of the present invention. Additional nucleotides containing 
restriction sites to facilitate cloning in the pQElO vector were added to the 5' and 3' 
primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5' and 3' primers were 
5 selected to amplify their respective nucleotide coding sequences. One of ordinary skill 
in the art would appreciate that the point in the protein coding sequence where the 5' 
and 3' primers begins may be varied to amplify a DNA segment encoding any desired 
portion of a polypeptide of the present invention. The 5' primer was designed so the 
coding sequence of the 6 X His tag is aligned with the restriction site so as to maintain 
10 its reading frame with that of E.faecalis polypeptide. The 3 ! was designed to include 
an stop codon. The amplified DNA fragment was then cloned, and the protein 
expressed, as described above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequences of Table 1 may also 
be cloned and expressed as fusion proteins by a protocol similar to that described 
15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, 
Madison, WI 5371 1) is preferentially used in place of pQElO. 

The above methods are not limited to the polypeptide fragements actually 
produced. The above method, like the methods below, can be used to produce either 
full length polypeptides or desired fragements therof. 

20 

Example 2(c): Alternative Expression and Purification of Enterococcal polypeptides 
in E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in 
25 this example, the polypeptide coding sequence is inserted such that translation of the 
six His codons is prevented and, therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequence encoding the desired portion of the E.faecalis amino acid 
sequence is amplified from an E.faecalis genomic DNA prep the deposited DNA 



WO 98/50554 



-68- 



PCT/US98/08959 



clones using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide 
sequences corresponding to the desired portion of the E. faecalis polypeptides. 
Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 
vector are added to the 5' and 3' primer sequences. 

5 For cloning a E. faecalis polypeptides of the present invention, 5' and 3 ' 

primers are selected to amplify their respective nucleotide coding sequences. One of 
ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5* and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of a polypeptide of the present invention. The 3' and 5' primers 

10 contain appropriate restriction sites followed by nucleotides complementary to the 5' 
and 3' ends of the coding sequence respectively. The 3 f primer is additionally designed 
to include an in-frame stop codon. 

The amplified E. faecalis DNA fragments and the vector pQE60 are digested 
with restriction enzymes recognizing the sites in the primers and the digested DNAs 

1 5 are then ligated together. Insertion of the E. faecalis DNA into the restricted pQE60 
vector places the E. faecalis protein coding region including its associated stop codon 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG. 
The associated stop codon prevents translation of the six histidine codons 
downstream of the insertion point. 

20 The ligation mixture is transformed into competent E. coli cells using standard 

procedures such as those described by Sambrook et al. E. coli strain M15/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor 
and confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are suitable for 

25 expressing E. faecalis polypeptide, is available commercially (Q1AGEN, Inc., supra). 
Transformants are identified by their ability to grow on LB plates in the presence of 
ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 
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Clones containing the desired constructs are grown overnight ("O/N") in liquid 
culture in LB media supplemented with both ampicillin (100 Mfi/ml) and kanamycin 
(25 p.g/ml). The O/N culture is used to inoculate a large culture, at a dilution of 
approximately 1 :25 to 1 :250. The cells are grown to an optical density at 600 nm 

5 ("OD600") of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lacl repressor. Cells subsequently 
are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the E. faecalis polypeptide, the cells are then stirred for 3-4 hours at 

10 4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Na- 
acetate buffer pH 6, supplemented with 200 mM NaCl. Alternatively, the protein 
can be successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 
mM Tris/HCl pH 7.4, containing protease inhibitors. After renaturation the protein 

1 5 can be purified by ion exchange, hydrophobic interaction and size exclusion 

chromatography. Alternatively, an affinity chromatography step such as an antibody 
column can be used to obtain pure E. faecalis polypeptide. The purified protein is 
stored at 4° C or frozen at -80° C . 

The following alternative method may be used to purify E. faecalis 

20 polypeptides expressed in E coli when it is present in the form of inclusion bodies. 
Unless otherwise specified, all of the following steps are conducted at 4-1 0°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 

25 unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 1 00 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
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(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 
5 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E. faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

10 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 1 2 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

15 tangential filtration unit equipped with 0, 1 6 (im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

20 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E. faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 

25 Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 1 0 column volume linear gradient ranging from 0.2 M NaCl, 50 
mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. 
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Fractions are collected under constant A 2 go monitoring of the effluent. Fractions 
containing the E.faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E. faecalis polypeptide exhibits greater than 95% purity after 
5 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 |ig of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0. 1 ng/ml according to LAL assays. 

1 0 Example 2(d): Cloning and Expression of E. faecalis in Other Bacteria 

E.faecalis polypeptides can also be produced in: E.faecalis using the methods 
of S. Skinner et aL, (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein 
Expr. Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et aL, 1997 
Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods 

15 Chang et al., U.S. Patent No. 4,952,508. 

Example 3: Cloning and Expression in COS Cells 

A E.faecalis expression plasmid is made by cloning a portion of the DNA 
encoding a E. faecalis polypeptide into the expression vector pDNAI/Amp or 

20 pDNAIH (which can be obtained from Invitrogen, Inc.). The expression vector 

pDNAI/amp contains: (1) an £. coli origin of replication effective for propagation in 
E. coli and other prokaiyotic cells; (2) an ampicillin resistance gene for selection of 
plasmid-containing prokaiyotic cells; (3) an SV40 origin of replication for propagation 
in eukaryotic cells; (4) a CMV promoter, a polylinker, an SV40 intron; (5) several 

25 codons encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate purification) 
followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and 
operably linked to the SV40 intron and the polyadenylation signal by means of 
restriction sites in the polylinker. The HA tag corresponds to an epitope derived 
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from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:767. 
The fusion of the HA tag to the target protein allows easy detection and recovery of 
the recombinant protein with an antibody that recognizes the HA epitope. pDNAIIl 
contains, in addition, the selectable neomycin marker. 
5 A DNA fragment encoding a E. faecalis polypeptide is cloned into the 

poly linker region of the vector so that recombinant protein expression is directed by 
the CMV promoter. The plasmid construction strategy is as follows. The DNA from 
a E. faecalis genomic DNA prep is amplified using primers that contain convenient 
restriction sites, much as described above for construction of vectors for expression of 

1 0 E. faecalis in E. coli. The 5 1 primer contains a Kozak sequence, an AUG start codon, 
and nucleotides of the 5' coding region of the E. faecalis polypeptide. The 3 1 primer, 
contains nucleotides complementary to the 3 f coding sequence of the E. faecalis DNA, 
a stop codon, and a convenient restriction site. 

The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested 

15 with appropriate restriction enzymes and then ligated. The ligation mixture is 
transformed into an appropriate E. coli strain such as SURE™ (Stxatagene Cloning 
Systems, La Jolla, CA 92037), and the transformed culture is plated on ampicillin 
media plates which then are incubated to allow growth of ampicillin resistant colonies. 
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis 

20 or other means for the presence of the fragment encoding the E. faecalis polypeptide 
For expression of a recombinant E. faecalis polypeptide, COS cells are 
transfected with an expression vector, as described above, using DEAE-dextran, as 
described, for instance, by Sambrook et al. (supra). Cells are incubated under 
conditions for expression of E faecalis by the vector. 

25 Expression of the E.faecalis-HA fusion protein is detected by radiolabeling 

and immunoprecipitation, using methods described in, for example Harlow et al., 
supra.. To this end, two days after transfection, the cells are labeled by incubation in 
media containing 35 S-cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent-containing RJPA buffer: 1 50 mM 
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NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as 
described by Wilson et al. {supra ). Proteins are precipitated from the cell lysate and 
from the culture media using an HA-specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
5 product of the expected size is seen in die cell lysate, which is not seen in negative 
controls. 

Example 4: Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E.faecalis polypeptide in this 

10 example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession 
No. 37146). The plasmid contains the mouse DHFR gene under control of the SV40 
early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate 
activity that are transfected with these plasmids can be selected by growing the cells 
in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 

1 5 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells 
resistant to methotrexate (MTX) has been well documented. See, e.g., Alt et al., 
1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 
1097:107-143; Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target 

20 enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is 
linked to the DHFR gene, it is usually co-amplified and over-expressed. It is known 
in the art that this approach may be used to develop cell lines carrying more than 
1 ,000 copies of the amplified gene(s). Subsequently, when the methotrexate is 
withdrawn, cell lines are obtained which contain the amplified gene integrated into one 

25 or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) 
of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. 
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the 
immediate early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 
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41:521-530. Downstream of the promoter are the following single restriction enzyme 
cleavage sites that allow the integration of the genes: Bam HI, Xba 1, and Asp 718. 
Behind these cloning sites the plasrnid contains the 3* intron and poly adenylation site 
of the rat preproinsulin gene. Other high efficiency promoters can also be used for the 

5 expression, e.g., the human fl-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet- 
Off and Tet-On gene expression systems and similar systems can be used to express 
the E. faecalis polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, 
Proc. Natl. Acad. Sci. USA 89:5547-5551. For the polyadenylation of the mRNA 

10 other signals, e.g., from the human growth hormone or globin genes can be used as 
well. Stable cell lines carrying a gene of interest integrated into the chromosomes can 
also be selected upon co-transfection with a selectable marker such as gpt, G418 or 
hygromycin. It is advantageous to use more than one selectable marker in the 
beginning, e.g., G4 18 plus methotrexate. 

15 The plasrnid pC4 is digested with the restriction enzymes and then 

dephosphorylated using calf intestinal phosphates by procedures known in the art. 
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E. 
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to 
the 5' and 3' sequences of the desired portion of the gene. A 5 ! primer containing a 

20 restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5' 
coding region of the E. faecalis polypeptide is synthesized and used. A 3' primer, 
containing a restriction site, stop codon, and nucleotides complementary to the 3 ? 
coding sequence of the E. faecalis polypeptides is synthesized and used. The 
amplified fragment is digested with the restriction endonucleases and then purified 

25 again on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector are 
then ligated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasrnid 
pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
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transfection. Five ng of the expression plasmid pC4 is cotransfected with 0.5 \ig of 
the plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin™ or 
LipofectAMINE.™ (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 

5 that confers resistance to a group of antibiotics including G41 8. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 

10 petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 

100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 pM, 2 ^lM, 5 jlM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 

15 100-200 |lM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 5: Quantitative Murine Soft Tissue Infection Model for E.faecalis 

Compositions of the present invention, including polypeptides and peptides, 

20 are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the following quantitative 
murine soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 
7 weeks old) are first treated with a biologically protective effective amount, or 
immune enhancing/stimulating effective amount of a composition of the present 

25 invention using methods known in the art, such as those discussed above. See,e.g., 
Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug 
per animal. 
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The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 
cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 with sterilized Cytodex 3 microcarrier beads preswollen 

5 in sterile PBS (3g/l 00ml). Mice are anesthetize briefly until docile, but still mobile 
and injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal 
subcutaneously in the inguinal region. After four days, counting the day of injection 
as day one, mice are sacrificed and the contents of the abscess is excised and placed in 
a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is 

10 then enzymatically treated and plated as follows. 

The abscess is first disrupted by voitexing with sterilized glass beads placed in 
the tubes. 3.0mls of prepared enzyme mixture (1 .0ml Collagenase D (4.0 mg/ml), 
1 .0ml Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 
20 min. incubation at 37C. The solution is then centrifuged and the supernatant 

15 drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated 
for 10 min. at room temperature. 0.5 ml media is then added and samples are serially 
diluted and plated onto agar plates, and grown overnight at 37C. Plates with distinct 
and separate colonies are then counted, compared to positive and negative control 
samples, and quantified. The method can be used to identify composition and 

20 determine appropriate and effective doses for humans and other animals by comparing 
the effective doses of compositions of the present invention with compositions 
known in the art to be effective in both mice and humans. Doses for the effective 
treatment of humans and other animals, using compositions of the present invention, 
are extrapolated using the data from the above experiments of mice. It is appreciated 

25 that further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 

Example 6: Murine Systemic Neutropenic Model for E, faecalis Infection 
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Compositions of the present invention, including polypeptides and peptides, 
are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the following qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Swiss female mice, approximately 7 
5 weeks old) are first treated with a biologically protective effective amount, or immune 
enhancing/stimulating effective amount of a composition of the present invention 
using methods known in the art, such as those discussed above. See,e.g., Harlow et 
al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

10 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left untreated for 5 days to 
begin recovery of PMNUS. 

The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 

15 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1:2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to 

20 ascertain morbidity and mortality. Mice remaining at the end of the experiment are 
sacrificed. The method can be used to identify compositions and determine 
appropriate and effective doses for humans and other animals by comparing the 
effective doses of compositions of the present invention with compositions known in 
the art to be effective in both mice and humans. Doses for the effective treatment of 

25 humans and other animals, using compositions of the present invention, are 

extrapolated using the data from the above experiments of mice. It is appreciated that 
further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 
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The disclosure of all publications (including patents, patent applications, 
journal articles, laboratory manuals, books, or other documents) cited herein are 
hereby incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific 
5 embodiments described herein, which are intended as single illustrations of individual 
aspects of the invention. Functionally equivalent methods and components are within 
the scope of the invention, in addition to those shown and described herein and will 
become apparant to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of 
10 the appended claims. 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 
EF001-1 (SEQ ID NO:l) 



TGAAAGAATA TTGCCAGAAC GTGGCGAGCA 
AAAAAAATGA AGTTCAAAAC TCTAGCAACA 
TTGGGGGCTT GTGGTAACGG TAATGGGGCC 
AAGGAAGATA CGACAATCAC TTTCTGGCAT 
ACAAAATTAA CGAAAGACTT CATGAAAGAA 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA 
TTACCAACAA TTACGCAAGC GTACCCAGGC 
TTAGTGGACT TAAAACCATA TATGGATGAT 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA 
AATAAATCGA CAGAAATGTT ATTCTATAAT 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG 
AACAAAGGCG TTGATTTTAA TAAAGACTTA 
GTGGACTATT ACCGTGATGG TATCGAAGCA 
TATTTATCTG GCCCATTTGC AAACAAAAAG 
GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA 
GAACAACGGA CAGCGGCATT TGAATTCATG 
TACTGGGCAC AACAAACAGG TTATATGCCA 
TACAAAAATT CTAAGACAAC CAAAGTACCT 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT 
GAAAGTATTT TTGCTTCATC AAATAAAGAC 
CAATTTGAAC AAGCATGGAA CCAATAA 



AATTGTTTTA TAAATTTTTT TAAGGGAGAG 
ACAGTGTTAG CAACCGCAGC TATTTTCGCA 
AAAGAATCAA ACGATATTGT GAAAGAAGTG 
GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TGGTTATGGA ATGCTGCACA AGATGAAATG 
GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATCGACGGCA AACAATACGG CATTCCATTT 
GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GCTTCTAAAA CAATTTACGA AAAATCCAAC 
TTAAATAACT ATTACGCAAT TGGAATGAAA 
GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GGTTACTTCC GCACAGCTGG TTCAGATAAA 
GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
GGTGGCTATG AATACGGTGT TGCACCACGT 
GATATTTATA TGTTCGATAG TGCTACGCCA 
AAATTCTTAG CTACTCCTGA TTCACAATTG 
ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
GCACAACTTG AAAACGCAGT AAAAGATTTA 
TCAGCCTATA ATGAAATGCG GACAATTATG 
ACGAGAAAAT TATTGAAAGA TGCAACATCA 



EF001-2 {SEQ ID NO:2) 

MKFKTLATT VLATAAIFAL GACGNGNGAK ESNDIVKEVK 

EDTTITFWHA MNGVQEEALT KLTKDFMKEN PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG YFRTAGSDKY LSGPFANKKV AMFVGSIAGA 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD IYMFDSATPE QRTAAFEFMK FLATPDSQLY 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA QLENAVKDLF AIP VEEN ADS AYNEMRTIME 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



EF001-3 (SEQ ID NO:3) 



TT GTGGTAACGG TAATGGGGCC AAAGAAT* 
AAGGAAGATA CGACAATCAC TTTCTGGCAT 
ACAAAATTAA CGAAAGACTT CATGAAAGAA 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA 
TTACCAACAA TTACGCAAGC GTACCCAGGC 
TTAGTGGACT TAAAACCATA TATGGATGAT 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA 
AATAAATCGA CAGAAATGTT ATTCTATAAT 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG 
AACAAAGGCG TTGATTTTAA TAAAGACTTA 
GTGGACTATT ACCGTGATGG TATCGAAGCA 
TATTTATCTG GCCCATTTGC AAACAAAAAG 



!AA ACGATATTGT GAAAGAAGTG 
GCAATGAATG -GGGTTCAAGA AGAAGCGTTA 
AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TGGTTATGGA ATGCTGCACA AGATGAAATG 
GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATCGACGGCA AACAATACGG CATTCCATTT 
GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GCTTCTAAAA CAATTTACGA AAAATCCAAC 
TTAAATAACT ATTACGCAAT TGGAATGAAA 
GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GGTTACTTCC GCACAGCTGG TTCAGATAAA 
GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG AATACGGTGT TGCACCACGT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA TGTTCGATAG TGCTACGCCA 
GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG CTACTCCTGA TTCACAATTG 
TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG AAAACGCAGT AAAAGATTTA 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA ATGAAATGCG GACAATTATG 
GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT TATTGAAAGA TGCAACATCA 
CAATTTGAAC AAGCATGGAA CCAA 



EF001-4 (SEQ ID NO:4) 
CGNGNGAK ESNDIVKEVK 

EDTTITFWHA MNGVQEEALT KLTKDFMKEN PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG YFRTAGSDKY LSGPFANKKV AMFVGSIAGA 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD IYMFDSATPE QRTAAFEFMK FLATPDSQLY 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA QLENAVKDLF AIPVEENADS AYNEMRTIME 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



EF002-1 {SEQ ID NO:5) 

TAAATAGCGG AGGTAGTACA AATGAAATTT TGGAAAAAAG GCTTAACAGC GGC AGCGCTG 
TTAGCAGTGG CGGCAGTAAC TTTAACAGCA TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 
GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA CTTGGAATTA TGACACGACC 
CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA ATCCTGATAT CACTATTGAA 
CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA CAACGATGCT TTCATCAGGA 
GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG ATATCGAACC TGCCAAAGCA 
AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC AGCCTTACCG TACAGATTTC 
TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG GAATTGCCTA TCCCGATAAC 
TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT CTAAACCAGA AGAACAAGTA 
TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA CTTATTATGA TCGCGCATTG 
AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG CAAAATCAAC AAAAGTAACG 
TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT ACATGGGTAG CTGGTACATG 
GGGACTTTAT TAACAAACAT TGATGATGGC AAAACAAATG TCGAATGGGG GATTGCCGAA 
ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT CACCGACAAG TTTTGCAATT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT CTTATAAAAC AGATGAAATT 
GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG AGTCTCACAA AAAGCCTTTA 
ACCCAGATAC AATTAATTTA G 



EF002-2 (SEQ ID NO: 6) 

MKFW KKGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 
EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
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KIYFARKGMP SDESHKKPLT QIQLI 

EF002-3 (SEQ ID NO:7) 

A TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 

GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA CTTGGAATTA TGACACGACC 
CC AG AATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA ATCCTGATAT CACTATTGAA 
CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA CAACGATGCT TTCATCAGGA 
GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG ATATCGAACC TGCCAAAGCA 
AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC AGCCTTACCG TACAGATTTC 
TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG GAATTGCCTA TCCCGATAAC 
TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT CTAAACCAGA AGAACAAGTA 
TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA CTTATTATGA TCGCGCATTG 
AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG CAAAATCAAC AAAAGTAACG 
TATCAATCAC AATTTG AAAA TTCAAAAGCG GCGATGATGT ACATGGGTAG CTGGTACATG 
GGGACTTTAT TAACAAACAT TGATGATGGC AAAACAAATG TCGAATGGGG GATTGCCGAA 
ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT CACCGACAAG TTTTGCAATT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT CTTATAAAAC AGATGAAATT 
GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG AGTCTCACAA AAAGCCTTTA 
ACCCAGATAC AATTAATT 

EF002-4 (SEQ ID NO:8) 

C GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 

EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDIjTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
KIYFARKGMP SDESHKKPLT QIQLI 



EF003-1 (SEQ ID NO:9) 

TAGGAGGACA AAAGAATGAA GAAGTTTTAT TTAGCNACAT TCGCTGTTAT TGCAACAGTT 
ATTTTAGCTG CCTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGCCTATGA TGAAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT TTAA 



EF003-2 (SEQ ID NO:10) 
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MKKFYL ATFAVIATVI LAACGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 
KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 

EF003-3 (SEQ ID NO: 11) 

CTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGCCTATGA TGAAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT T 



EF003-4 (SEQ ID NO: 12) 

CGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 

KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 



EF004-1 (SEQ ID NO:13) 

TAAATCGAAA GAAGGATGAT AGAAATGAAA AAAATGATTA AATTTGCAGG CATTGCTCTT 
ATTTTTGCAG CTCTTCTCTC TGCCTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 
GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AATAA 

EF004-2 (SEQ ID NO: 14) 

MKK MIKFAGIALI FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 
ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF004-3 (SEQ ID NO: 15) 
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CTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 

GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TG AC GAG AAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AA 



EF004-4 (SEQ ID NO: 16) 

CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 

ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF005-1 (SEQ ID NO: 17) 

TAAAAAATGA AAAAACGATT GACGATTGTG GGGATGCTTT TTCTGGCCAT TTTAGTAATG 
GTTGGTTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AATAA 

EF005-2 (SEQ ID NO:18) 

MKKRLTIVG MLFLAILVMV GCGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 
WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 

EF005-3 (SEQ ID NO: 19) 

TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
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GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AA 

EF005-4 (SEQ ID NO: 20) 

CGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 

WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 



EF006-1 (SEQ ID NO:21) 

TAAACGATAA ATGGAGGGAA TAAGATGAAA AAACGTACAT TATGGTCAGT AATTACTGTA 
GCAGTAGCTG TCTTAGTTTT AGGGGCTTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 
AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 
TAA 

EF006-2 (SEQ ID NO:22) 

MKK RTLWSVITVA VAVLVLGACG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 
KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE . 

EF006-3 (SEQ ID NO:23) 

TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 

AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
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CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-4 (SEQ ID NO:24) 

CG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 

KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF008-1 (SEQ ID NO:25) 

TAAACCGTGA GAAAGAAATG GAGGAATCAA CGAATGAAAA AATTTAGTTT ATTTTTTTTA 
ACACTTTTAG CAGGGTTAAC GTTAGCTGCT TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 
AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAATAA 



EF008-2 (SEQ ID NO:26) 

MKKFSLFFLT LLAGLTLAAC GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 
KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 

EF008-3 (SEQ ID NO: 27) 

T TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 

AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
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TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAA 

EF008-4 (SEQ ID NO:28) 

C GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 

KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 



EF009-1 (SEQ ID NO:29) 

TGACAAATGA AAAAATTTAG TAAATTAATT GG AC TTATTG GGGTATTAGC TTTTACGATT 
GCAGGTTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATT AC TG AAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAATAA 

EF009-2 {SEQ ID NO:30) 

MKKFSKLIG LIGVLAFTIA GCASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 
LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 

EF009-3 (SEQ ID NO:31) 

TTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
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GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAA 



2F009-4 (SEQ ID NO:32) 

CASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 

LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVS IGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 



EF010-1 (SEQ ID NO: 33) 

TGAAAGAATA AAATTGTACA GGAGGAAATA AGGAATGAAA AAATGGCAAA AAGGATTAGC 
CGTAGCTGGC GCACAGCTTT AGCTGTAGGA CTAAGCGCGT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA ACGTTATTAA TGTATCGTGT TGGGGACAAA 
CCAGATAATT ATGACCAATT AATCGATAAT GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GCAAAATTAA AAATGGAATT TGTTGGTTGG GGCGATTGGG ACCAAAAAAT GTCAACAATC 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA TTAGCACAAA ATTATGCAAC GAATGCACAA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
CAATTGCCAG ATAACTATAT TAAAGGAAAT ACGATTAATG GAAAACTGTA TGCGTTCCCA 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT TTAACTTTTA ATAAAGAATA TGTCGATAAA 
TACAATTTAG ATATTAGTAA AGTCGATGGT AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GAATTCCNTA AAAANGANCC AAATATTGCT GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT TTGCAGTAAA AACAACTGAT 
ACTGGCTCAC CAAAAATTAT TAACCAATAT GCCGACAAAG ACATGATTAA TAACTTAAAA 
GTCTTGCATC AATGGTATAA AGATGGCTTG ATTCCAACAG ATGCTGCTAC AAGTACAACA 
CCATATGACT TAAATACCAA TACTTGGTTT ATGCGTCAAG AAACACAAGG ACCTATGGAT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT GGCAAACCAC TTGTTTCTCG TCCACTAACA 
GAACCATTAA AAACAACAGC TCAAGCGCAA ATGGCTAACT ATGTTGTTGC AAACACGTCT 
AAAAACAAAG AAAAATCTGT TGAATTGTTA GGTTTATTAA ACAGCAATCC AGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA CAATATGAAA AAGTTGGCGA TGATCGTGTG 
AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG CTTGGAACAC AGGAAACAAC 
TTAATCATTT GGCCAGAAGA ATCTGTCACT GAAGAAATGG TTAAAGAACG TGATAAGAGC 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
ACTGAAATCA CTAACGTTGC TACAGTTATG AACCGTTACG CAGCAAGCTT AAATACAGGA 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
GATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAAT ATATCCAATC TCAAAAATAA 

EF010-2 (SEQ ID NO:34) 

MAKRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL AQNYATNAQK 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYWANTSK NKEKSVELLG LLNSNPELLN 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE EMVKERDKS I 
EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL MDDLKTAGWD 
KVQKEMQTQL DEYIQSQK 



EF010-3 (SEQ ID NO:35) 
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GT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA 
CCAGATAATT ATGACCAATT AATCGATAAT 
GCAAAATTAA AAATGGAATT TGTTGGTTGG 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA 
CAATTGCCAG ATAACTATAT TAAAGGAAAT 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT 
TACAATTTAG ATATTAGTAA AGTCGATGGT 
GAATTCCNTA AAAANGANCC AAATATTGCT 
ACAGGTAATT ATGACTTCCC TATTGGTAAC 
ACTGGCTCAC CAAAAATTAT TAACCAATAT 
GTCTTGCATC AATGGTATAA AGATGGCTTG 
CCATATGACT TAAATACCAA TACTTGGTTT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT 
GAACCATTAA AAACAACAGC TCAAGCGCAA 
AAAAACAAAG AAAAATCTGT TGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA 
AAATTGTTGA AAGATTACAC ACCAACAACT 
TTAATCATTT GGCCAGAAGA ATCTGTCACT 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT 
ACTGAAATCA CTAACGTTGC TACAGTTATG 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA 
GATAAAGTTC AAAAAGAAAT GCAAACACAA 



ACGTTATTAA TGTATCGTGT TGGGGACAAA 
GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GGCGATTGGG ACCAAAAAAT GTCAACAATC 
TTAGCACAAA ATTATGCAAC GAATGCACAA 
GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
ACGATTAATG GAAAACTGTA TGCGTTCCCA 
TTAACTTTTA ATAAAGAATA TGTCGATAAA 
AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
CAATATCCAT TTGCAGTAAA AACAACTGAT 
GCCGACAAAG ACATGATTAA TAACTTAAAA 
ATTCCAACAG ATGCTGCTAC AAGTACAACA 
ATGCGTCAAG AAACACAAGG ACCTATGGAT 
GGCAAACCAC TTGTTTCTCG TCCACTAACA 
ATGGCTAACT ATGTTGTTGC AAACACGTCT 
GGTTTATTAA ACAGCAATCC AGAATTGTTA 
CAATATGAAA AAGTTGGCGA TGATCGTGTG 
CATTTGAGTG CTTGGAACAC AGGAAACAAC 
GAAGAAATGG TTAAAGAACG TGATAAGAGC 
GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
AACCGTTACG CAGCAAGCTT AAATACAGGA 
TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
TTAGACGAAT ATATCCAATC TCAAAAA 



EF010-4 (SEQ ID NO:36) 

CGKSSKt)A ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT 
NLDISKVDGS YESATEVLKE FXKXXPNIAA 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH 
EEAKDSPILG FTFVNDKVKT EITNVATVMN 
KVQKEMQTQL DEYIQSQK 



DWDQKMSTIV ASGESYDISL AQNYATNAQK 
INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
ANYWANTSK NKEKSVELLG LLNSNPELLN 
LSAWNTGNNL IIWPEESVTE EMVKERDKSI 
RYAASLNTGT VDPEETLPKL MDDLKTAGWD 



EF011-1 {SEQ ID NOT37) 

TAACGTTTTT GGAGGAAAAG AATGAAAAAG AAATTTTTAG CAATGATGGC AGTTTCAATG 
ATGGGACTGT TAATGTTAAG TGCTTGTCAA ACAAATAAAA AAACAGCAGA TTCTGCAACA 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA GTCAAAGACA CCAATGGTCA ATTAACCGTT 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT GATAATGGTT CCTTGGATAC AATGGATGCA 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG CCAACTAAAA ATATCCCTGC GTATTTGAAA 
AAATACCAAA AAGTTGAATC AGCAGGCGGC ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
CAACTAAAAC CAGACTTAAT TATTATTTCT GGTCGTCAAC AAGATTATCA AGAACAATTA 
AAAGCCATTG CGCCAACCAT TTACTTAGCT GTAGATGCCA AAAATCCTTG GGCATCAACG 
AAACAAAATA TCGAAACGTT AGGCACTATT TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
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ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
AAAGGATTAG AGTAA 

EF011-2 (SEQ ID NO:38) 

MKKK FLAMMAVSMM GLLMLSACQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 
KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI. KEPDLEKINQ 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF011-3 (SEQ ID NO:39) 



TTGTCAA ACAAATAAAA AAACAGCAGA TT( 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATACCAAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 
ACTCATGGGC AAAGTGTTTC TTACGAATAT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA 
AAAGGATTAG AG 



GTCAAAGACA CCAATGGTCA ATTAACCGTT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GACGATTCAA AAGATAACGT CGCTGCAAAC 
AATGATAAAG TCATTATGCT TCAACCAGAT 
TCAATGCATT TGATGATAGA AGATGTTAAA 



EF011-4 (SEQ ID NO:40) 

CQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF012-1 (SEQ ID NO:41) 

TGAGGGGGCA ACAACATGAA ATTGGGGAAA 
CTTTTAGCCG CATGTGGCGG AACCAAAGAA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA 
CAAACAACAG ATAAAAATAC CTTTACAATG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT 



AAAGTAGTAG GTTTGATTGC AACAGGGTTT 
GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
CCTGCACCAA TCTCAACATT GGATACAACA 
GCACAACATT TATTTGAAGG CCTTTATCGG 
CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
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GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAATGA 

EF012-2 (SEQ ID NO:42) 

MKLGKK WGLIATGFL LAACGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 
TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF012-3 (SEQ ID NO: 43) 

ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 
CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
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TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTAGGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAA .. 



EF012-4 (SEQ ID NO:44) 

CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 

TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY <3NYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF013-1 (SEQ ID NO:45) 

TAACGAAAAA TGAAAAAAAT TGCTTTGTTC 
AGTCTAGCAG GATGTGGAAA CAAAAAAACA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT 
TCTGTCACGG GTCAAAATTC TAATGTGACA 
CCTTCTGCAG GAACTGATGA AACGAATGAA 
ATTACAATTA CCAATGTTGT TTTCAATCCT 
CCTAATGCAA CCATTACAGC AACGGTAGTC 
TATGCGGATG CCAATGGCAA TTTTACAGTA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA 
GATTTACCAA ATTAA 



AGTATGTTAA CGTTCAGTGT ATTGTCTTTA 
GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
AGCCAACAAG AAATGAAAAG TAGTCATTCG 
GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
GTCCCTCAAA ACCAAGCACC TGATACAAAC 
GAAAGAAATG AAATTAATGG TACTACATTA 
GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
GCACCTGTCC AAATTGATAT TCCAAGTTCA 
ATTACGATTG ATCCGAAACA AGGGACAATT 
TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
GGG AATTTCA CAGCCAGTAA TTTAGTTCCC 
AATGGAGAAA TAGGGACACC TTACTTGTTT 



EF013-2 (SEQ ID NO:46) 

MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKS SS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 
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EF013-3 (SEQ ID NO: 47) 

ATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA AT 

EF013-4 (SEQ ID NO:48) 

CGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 

VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 

EF014-1 (SEQ ID NO:49) 

TGATGGTGGA GACTTTTTAA GAGAGAGGAA GTACAGCCAA TGAGTAGGAA GCGAAAAATC 
AGCTTAATTA GTTTAGTCAT CATTTTGGTT TTTGTCACAG TCGGCTCAGC ATACTTTGCT 
GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA ATAA 

EF014-2 (SEQ ID NO: 50) 

MSRKRKIS LISLVIILVF VTVGSAYFAV AGSYLKKTID KGYVPIKNDY 
NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GGIDVNVKQA 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
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YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF014-3 (SEQ ID NO:51) 

TGCT 

GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA A 



EF014-4 (SEQ ID NO:52) 
AV AGSYLKKTID KGYVPIKNDY 

NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GG I DVNVKQA 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF015-1 (SEQ ID NO: 53) 

TAATTAAAAA TGTGTAAAAA GGGTCTGATG AAAAAAGGAG ACATAATAGT TATTATCTTT 
TTAATAGCTA TCTCTTTTTC TCCATATTTT ATTTTTTTTC ACAATAATCC ATTTAACTCC 
AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATC CATC AAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA CTAA 

EF015-2 (SEQ ID NO:54) 

MK KGDIIVIIFL. IAISFSPYFI FFHNNPFNSK SFDDTKYAW KIDGKEIERI 
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 

EF015-3 (SEQ ID NO:55) 
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CAATAATCC ATTTAACTCC 

AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 

ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA AGGGCAATAT 

AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 

ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 

AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA C 

EF015-4 (SEQ ID NO:56) 

NNPFNSK SFDDTKYAW KIDGKEIERI 

NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 



EF016-1 (SEQ ID- NO: 57) 

TGACGGTTGC CCCCGTCCAA TAGAAAGGAG TTTATGATGA AAAAGAAATA TTCTTTAGCC 
TTGCTGGTTA TCTGTTGTAG TTTACTCCTA TTTGCAGGTT GTGGTAAAAG AAAAAGCAAC 
GAAGATCAAT GGACACGGAT TAACGAAGAA AAACGGATTA TTATTGGCTT AGATGACTCC 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATGAAAGAAA CAGAATTACA AAATCAAACC ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC ACACAACCTT ACATGACGAA CGACCAAGTA 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
AAATTTGTTA AAGACCAAAC ACCTATTTTA TATGACGGCT TTAATGAAGC TTTCTTAGAT 
TTAAAATCTG GTCGAATTGA CGGACTCCTA ATCGATCGCG TTTACGCCAA CTACTATCTT 
TCCCACGAAG ATAATTTAAA AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT AAAATTTCTC AAAAATGGTT TGGAGAGGAC 
GTTACAAATA ACACAAAAAT AAACTAA 



EF016-2 (SEQ ID NO: 58) 

MMKKKYSLAL LVICCSLLLF AGCGKRKSNE DQWTRINEEK RIIIGLDDSF 
VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF016-3 (SEQ ID NO: 59) 
AAGCAAC 

GAAGATCAAT GGACACGGAT TAACGAAGAA AAACGGATTA TTATTGGCTT AGATGACTCC 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATGAAAGAAA CAGAATTACA AAATCAAACC ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC ACACAACCTT ACATGACGAA CGACCAAGTA 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
AAATTTGTTA AAGACCAAAC ACCTATTTTA TATGACGGCT TTAATGAAGC TTTCTTAGAT 
TTAAAATCTG GTCGAATTGA CGGACTCCTA ATCGATCGCG TTTACGCCAA CTACTATCTT 
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TCCCACGAAG ATAATTTAAA AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT AAAATTTCTC AAAAATGGTT TGGAGAGGAC 
GTTACAAATA ACACAAAAAT AAAC 

EF016-4 (SEQ ID NO: 60) 

SNE DQWTRINEEK RIIIGLDDSF 

VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF017-1 (SEQ ID NO: 61) 

TGAGGTGTTT TTATGAAAAG GGCAACAAAG CAAAGGCTGT CTTTGGCAGC AATCATGGTT 
CTACTTCTCT CGGGCTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 
TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGGGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAGTA A 



EF017-2 (SEQ ID NO: 62) 

MKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 
AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 



DLIWNGYTKT 
FESQPDVLKK 
SHVGYDNEDF 
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LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 

PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 

NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 

EF017-3 (SEQ ID NO: 63) 

CTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 

TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 

ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 

GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 

ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 

TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 

TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 

GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 

TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 

ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 

GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 

CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 

AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 

CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 

TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 

TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 

TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 

GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 

TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 

ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 

TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 

GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 

AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 

GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 

CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 

GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAG 



EF017-4 (SEQ ID NO: 64) 

CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 

AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ . LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 



EF018-1 (SEQ ID NO:65) 

TGTCATTACA ACGATACCAA TTTTAATCAT TTATCCATTA CTACAAAAAC ACTTTATCGG 
CGGTATGATG GCCGGTGCAG TAAAAGAATA AAGAAAGTAG GGAACAATAT GAAAAAAGTT 
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TTAGGCGGTT TATTGGTGGC AACGGCGGTC GTTAGTTTAG CGGCCTGTAG CGGTGGGGAA 
AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGAOGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AG AC TTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TCGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT . 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 
TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT 
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC 
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC 
AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAGTAA 



EF018-2 (SEQ ID NO: 66) 



MKKV LGGLLVATAV VSLAACSGGE 
KKASSDVSIK DRYELDEKTP AWKLDKKKEP 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY 
MQEWGKGKLV PHFVIENINP DAGTPEARAN 
KAFLKSNKWD AIEKIKSEKM AENRDKLK . 



TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
KTGQAALKAD SWAYS LNDLA KKYDPYLMKV 
SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 
KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
EAIETKLNTT VISMIRAKDD KAFDKSLEDY 



EF018-3 (SEQ ID NO : 67 ) 



CTGTAG CGGTGGGGAA 
AAGAAAGCTA GCTCAGATGT 
GCTTGGAAGT TAGATAAGAA 
TGGACGGCGC TGCCTTTTGG 
GTGGATATTG AATTTATTTC 
GATATGCCTG ATATCGTGAC 
TCTTGGGCCT ATTCTTTAAA 
GTTAACCAAG ATACGTTTAA 
AATTACTCTA ATACAAAAGC 
GTTATTCGTG AAGATGTCTA 
TTTGAAAAAG TCATGCAACA 



CTCAATTAAG GATCGGTATG 
GAAAGAACCG ACCAAGATTA 
AAAAGACGTG ACCACTGCGC 
CGGCGATGAT TCAAAATTAA 
ATTAACTGAA AAAACTGGAC 
CGATTTAGCT AAAAAATATG 
ATGGTATGCC TTAGAGGATG 
GGATTATGAA AGTGGAAATA 
TAATGCATTA GGCAAGCCAG 
GATTAAAGAA AAATATCCTG 



AATTAGATGA AAAGACGCCT 
AATGGTATAT TAACTCAGAT 
AGATTAAAAA AGACTTAAAT 
ATGCCATGAT TTCAAGTGGA 
AAGCAGCATT GAAAGCAGAT 
ACCCCTATTT AATGAAAGTT 
GAAAAACATA TGGTTACCCT 
TCCCAGTAAA TGATAATTTT 
ACGTTTCAAC ACCAGAAAAT 
AGATGACCCC AATGGGCTTC 
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ACCACAGTGG GCGATGGTGC 
CCTTTAGAGG ATAAAAATGG 
TGGTTAAAAA CATTTAATGA 
GATGATGGGG CAACGTTTGA 
GCTGGAACCA GTGGTCAAGG 
TATATAGCCA TTGATGGACC 
GGCATTTCAG GTTGGTTAAG 
ACTCAACTGT TCACATATTT 
GAAGGAGTTA CTTATGCGTA 
AAATTAGAAC AAACAGACAA 
TACTTTAACA ACGACCGTGT 
ATGCAAGAAT GGGGCAAAGG 
GATGCAGGAA CGCCGGAAGC 
GTTATTTCAA TGATTCGTGC 
AAAGCATTCT TAAAATCAAA 
GCGGAAAACA GAGACAAACT 



AGGACCATTT TTAGACAAAT 
TAAATACTAT GATCGAAATT 
TGTTTACCGA GCAGGCAATA 
TGAAAAAGTG AAACAAGGAA 
TGGGAACTTC ACAGAATTTA 
AAGTAGCACT TCTGGCCGAA 
TAATTACATT ACGAAAGATG 
AATTGATGAA CCGGGACAAA 
CAATGATCAA GGAAAAATTG 
TGATGCCTAC AACAAAAAAT 
CAATAAACTA AAAGTACCAA 
AAAATTAGTC CCACATTTCG 
TCGTGCGAAT GAAGCGATTG 
GAAAGATGAT AAAGCCTTTG 
TAAATGGGAT GCAATTGAAA 
TAAG 



TACAAGACTT CTTAGGTGTT 
TAGATAAAGA ATATTTAGAA 
TTAGTGATGA TAGCTTCACA 
ATTATGCAAC CATGCTCGTT 
TGAAAAAATC TGGCACACGT 
AACCAACATT AAATCAAACC 
CGAAAGATCC AGCAAAAGTC 
TTTTAACAAA ATATGGCGTT 
ATTATTTACC AGAAGTGAAA 
ATGGCATTAG TCGTTTCCTA 
TGGAAAGTGC TTTAACGCAA 
TAATTGAAAA TATTAATCCA 
AAACCAAACT AAATACAACC 
ACAAATCTTT AGAAGACTAC 
AAATAAAATC TGAGAAAATG 



EF018-4 (SEQ ID NO:68) 



CSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP 
VDIEFISGDD SKLNAMISSG DMPDIVTVTE 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY 
MQEWGKGKLV PHFVIENINP DAGTPEARAN 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 



TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
KTGQAALKAD SWAYS LNDLA KKYDPYLMKV 
SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 
KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
EAIETKLNTT VISMIRAKDD KAFDKSLEDY 



EF019-1 (SEQ ID NO:69) 

TAAAGGAGTT ACACAATGAA ACTTTTAAAA 
CTTGGTTCAT TCTTACTCGC AGCTTGTGGT 
AAAACACATG AAGTAACAGA TACCTTAGGC 
CGGATTATTG CGAGTTATTT AGAAGATTAT 
CAATGGACAG TTGGACAAGG CAGCATTCAA 
CCCACTATTT CCTATGACTT GCCATATGAA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA 
CTTGTAAACA GCGATGAATC AGCACCTCTT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT 

EF019-2 ( (SEQ ID NO:70) 



AAGACGGTCC TAATTGGTAC AACCCTTCTT 
AATACGAATA AAGAAGCCAA CAACGCTGAC 
AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
GATTATTTAG CGAAAGAATT GAAAGATGTC 
GCGGTTCTAA AATTTGAACC TGACTTATTA 
GGTAAATACA AAGAATACAG TAAAATTGCG 
GTCACCTGGC GTGATCAATT GGAAGATATT 
AAAAAAGTGT TAGAAGATTA TGATACCTTA 
AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TTAGTGGAAG AAATTTCTAA AAACGCTACT 
TTAGCTGAGC TTGACGCAGA CCACATTTTC 
TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GATAAAAAAA GTAGTTGGTT ATACAACGGA 
GTAAAAAAAG CGCTCTTAAA TTAA 



MKLLKK TVLIGTTLLL GSFLLAACGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 
IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
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ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
IANTQIVEDV KKALLN 

EF019-3 (SEQ ID NO:71) 

TTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC 

AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC 
CCCACTATTT CCTATGACTT GCCATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC 
CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA GTAGTTGGTT ATACAACGGA 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA T 



EF019-4 (SEQ ID NO:72) 

CGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 
I I ASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP 
ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA 
IANTQIVEDV KKALLN 



EF020-1 (SEQ ID NO:73) 

TGAGGAGATG AGAAAATGAA AAAGGTAGTT TCAATTTTGT TGATGGTTGT TGCAGTCTTC 
ACATTAACTG CATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AATAA 

EF020-2. (SEQ ID NO:74) 

MKKWS ILLMWAVFT LTACNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 
KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF020-3 (SEQ ID NO:75) 
ATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 



TISYDLPYEA VLKFEPDLLL 
TVLDKKEQAK KVLEDYDTLT 
QDLGLQVPKL VEEISKNATA 
VKNNQVHTYD KKSSWLYNGP 
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ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 

TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 

ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 

AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AA 

EF020-4 (SEQ ID NO:76) 



CNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 

KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF021-1 (SEQ ID NO:77) 

TAGTTGTTTA AATACATTAA 
TTATTCGGTT TTAGTTTGAT 
GGCAAAGGCA AAACCGCTGA 
ATCATTACAG ATACAGGCGG 
TTGCAAGCTT GGGGTAAAGA 
CAATCGAATG ATGCAGCTGA 
AACACAATCT TTGGTATTGG 
AACCCTGATA CAAACTTTGT 
TCTGCAACAT TTAGAGATAA 
ACAAAAACGA ACAAAGTCGG 
CAAGCTGGTT TTGAAAAAGG 
GTTGATACGA AATATGCGGC 
GCAATGTACC AAAACGGCGT 
GTCTTCCAAG AAGCAAAAGA 
GGCGTTGACC GCGATCAAGA 
AACTTCACGT TAACTTCAAC 
CGTGCGTTAG AAGACAAATT 
GGCGTTGACT TAACAGACGG 
AAAGATAAAG TAATCTCAGG 



ACTATTTTTA GGAGGCTTTA 
TGCATTAGGT TTATCAGTTT 
AAGCGGCGGT GGCAAAGGGG 
CGTGGATGAC AAGTCGTTCA 
ACATGATTTA CCAGAAGGTT 
CTATACAACC AATATTGACC 
CTACTTGCTA AAAGATGCAA 
TTTAATCGAT GATCAAATCG 
TGAAGCAGCT TACTTAGCCG 
TTTTGTTGGT GGTGAAGAAG 
TGTGGCTGAT GCTGCGAAAG 
TTCATTTGCT GATCCTGCCA 
TGATATCATC TTCCATGCTT 
CTTGAATGAA TCAGGTTCTG 
TGCTGATGGC AAGTACAAAA 
GCTTAAAGGT GTCGGCACAG 
CCCTGGTGGC GAACATTTAG 
CTATTTAAAC GACAAAACAA 
TGACGTAAAA GTCCCAGAAA 



CAGAAATGAA AAAAGCAAAA 
CACTTGCAGC ATGTGGTGGT 
ATGCAGCGCA TAGTGCTGTA 
ACCAATCTTC TTGGGAAGGA 
CAAAAGGGTA TGCATATATT 
AAGCGGTATC AAGTAAATTC 
TTTCTTCTGC AGCAGATGCC 
ATGGCAAAAA GAATGTCGTT 
GTGTTGCTGC TGCAAATGAA 
GGGTCGTAAT TGACCGTTTC 
AATTAGGTAA AGAAATTACT 
AAGGGAAAGC TTTAGCTGCT 
CTGGTGCGAC TGGACAAGGG 
GCGACAAAGT TTGGGTAATC 
CAAAAGACGG CAAAGAAGAC 
CGGTTCAAGA TATTGCCAAC 
TTTATGGATT AAAAGATGGT 
AAGAAGCTGT TAAAACAGCA 
AACCAGAATA A 



EF021-2 (SEQ ID NO;78) 

MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI 

ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNWS ATFRDNEAAY LAGVAAANET 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD PAKGKALAAA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 
DKVISGDVKV PEKPE 



EF021-3 (SEQ ID NO:79) 
ATGTGGTGGT 

GGCAAAGGCA AAACCGCTGA 
ATCATTACAG ATACAGGCGG 
TTGCAAGCTT GGGGTAAAGA 
CAATCGAATG ATGCAGCTGA 



AAGCGGCGGT GGCAAAGGGG 
CGTGGATGAC AAGTCGTTCA 
ACATGATTTA CCAGAAGGTT 
CTATACAACC AATATTGACC 



ATGCAGCGCA TAGTGCTGTA 
ACCAATCTTC TTGGGAAGGA 
CAAAAGGGTA TGCATATATT 
AAGCGGTATC AAGTAAATTC 
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AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC 
AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA GAATGTCGTT 
TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA 
ACAAAAACGA ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 
CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AGAAATTACT 
GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 
GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC TGGACAAGGG 
GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC 
GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC 
AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC 
CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT 
GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA 
AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAA 



EF021-4 (SEQ ID NO: 80) 

CGGG KGKTAESGGG KGDAAHSAVI 
ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE 
DKVISGDVKV PEKPE 



EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
QIDGKKNWS ATFRDNEAAY LAGVAAANET 
AKELGKEITV DTKYAASFAD PAKGKALAAA 
GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 



EF022-1 (SEQ ID NO:81) 

TAAGAGCATA AAAAAATGAA GAGTTATAGG 
ACAATGGTTT GTATTTTATT GGTAGGATTT 
AAAAAGAAAC AGAAAAATAC CAAAGAAGCC 
ACGCTCAACA CCTCTGTATT ATTGGATTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC 
CCGATGATTT CAGAAGATGG AAAAACCTAC 
AGTAACGATG ATCCTGTCAC AGCACATGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA 
GATAAAGTCG TCTATAATGG TCCGTTCGTG 
TGGCAACTAG CAAAAAATAA TCGCTATTGG 
AATTATACAG TTATCAAAGA AACATCTACC 
GATGTGGCTA CACTAAGTGG TGAACTGGCG 
TCGTATCCAA CAGCGACAAT GAACTATTTG 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA 
CTAGTCAATA ATATTATTGC AGATGGTTCT 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG 
GTTAACGTTG AATTGATGGT AACAGATGAT 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT 
CCAGACTATC AAGACCCTAT TTCTACCCTG 
TATCAGAACC CTGTCTATGA CAAATTATTA 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA 



AGAAAGAAGA TGAAAAAGTA TTTAAAAATC 
TTAGCTGGGT GTACCAATAA AAATGAAAAT 
GTTCAACTGA TGTCACCCTC GGAATTAACA 
CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
TTTGAATATG CTTGGAAAAA AATGATTGAT 
ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TACTTTACGT CCTTGTTAGC TTTTCCGACA 
CAATTTGGTG CGGACTATGG AACTGCTAGT 
GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
GATCACCAGA ACGTGCGCTC AGACATTATC 
GCATTGAATC TTTTTGAAGA TGGACAATTA 
CAACAGAATA AAAATAATAC GTTGTATCAT 
CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
TTGACAATAG AGCTAACCGC ATTGCCGACT 
GACTATGATT TATTCTTAAT TTACTGGACA 
ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
GATGAAGCAG CCACAACCTA TGCCTTAGAG 
GCTGAAAAAG AAGTGATTGA AACGACTGCT 
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GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAATAA 



EF022-2 {SEQ ID NO:82) 

MKKYLKIT MVCILLVGFL AGCTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDI IN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 



EF022-3 {SEQ ID NO: 83) 
GT GTACCAATAA AAATGAAAAT 

AAAAAGAAAC AGAAAAATAC CAAAGAAGCC GTTCAACTGA TGTCACCCTC GGAATTAACA 
ACGCTCAACA CCTCTGTATT ATTGGATTTT CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
CCGATGATTT CAGAAGATGG AAAAACCTAC ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
AGTAACGATG ATCCTGTCAC AGCACATGAT TTTGAATATG CTTGGAAAAA AATGATTGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG TACTTTACGT CCTTGTTAGC TTTTCCGACA 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA CAATTTGGTG CGGACTATGG AACTGCTAGT 
GATAAAGTCG TCTATAATGG TCCGTTCGTG GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
TGGCAACTAG CAAAAAATAA TCGCTATTGG GATCACCAGA ACGTGCGCTC AGACATTATC 
AATTATACAG TTATCAAAGA AACATCTACC GCATTGAATC TTTTTGAAGA TGGACAATTA 
GATGTGGCTA CACTAAGTGG TGAACTGGCG CAACAGAATA AAAATAATAC GTTGTATCAT 
TCGTATCCAA CAGCGACAAT GAACTATTTG CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
- ACGCCGCTTG CAAACGAAAA CCTGCGTAAA GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
CTAGTCAATA ATATTATTGC AGATGGTTCT AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GTTAACGTTG AATTGATGGT AACAGATGAT GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT TTGACAATAG AGCTAACCGC ATTGCCGACT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT GACTATGATT TATTCTTAAT TTACTGGACA 
CCAGACTATC AAGACCCTAT TTCTACCCTG ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
TATCAGAACC CTGTCTATGA CAAATTATTA GATGAAGCAG CCACAACCTA TGCCTTAGAG 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA GCTGAAAAAG AAGTGATTGA AACGACTGCT 
GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAA 



EF022-4 (SEQ ID NO: 84) 

CTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI 



LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
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KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 

EF023-1 (SEQ ID NO:85) 

TAAAATGGAG GGATCGGTAT GAAGAAATTA AAAATGTTAG GATGCGTCGG GTTGCTTTTA 

GCTTTAACGG CTTGTCAGGC GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 

CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 

GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 

GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 

GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 

AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 

GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 

AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 

ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 

ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 

TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 

ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 

CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 

GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 

ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 

AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 

TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 

GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 

GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 

TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 

CAATTGGAAG AAAATC TAG A TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTT TACT ATT 

CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 

TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 

ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 

GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATC AGGAA 

ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 

GATCTGTATT GGCATTCATT TGGACCCACG' TACAGTTTAA AATGGGCTTA TGTTAACTAA 



EF023-2 (SEQ ID NO:86) 

MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 
TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 
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EF023-3 (SEQ ID NO: 87) 

GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 

CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT * CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAAC 



EF023-4 (SEQ ID NO: 88) 

GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 

TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 

EF024-1 (SEQ ID NO:89) 

TAATGGCCGT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAATAA 
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EF024-2 (SEQ ID NO: 90} . 

M KKVLPFIALV GLLLLSGCGT DMKKILTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF024-3 (SEQ ID NO: 91) 
ATT GACTGCCGAT 

GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTC AG A GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAA 



EF024-4 (SEQ ID NO: 92) 



LTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 

EF025-1 (SEQ ID NO: 93) 

TGAATGAAAC ATATTAAAGG AATGTTGGTT TTTATCGGAT TATTTATTTT GGTTGGTTGT 
GCGCCAGATC AAGAGCCAAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 
AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG 
GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC 
TATTATAAAA ATGAAGGTGT ATTTACTGGC GGAAATTACG GCATTGTAGA GATTATTACG 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
AGCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA GCTATGATGA ATACTTTTCA 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT TCATTTTGTA G 



EF025-2 (SEQ ID NO: 94) 

MKHIKGMLVF IGLFILVGCA PDQEPTKQTT 
DELIANQLTF DSHEYTYEW TGATQTTFGT 
YKNEGVFTGG NYGIVEIITE PETQRILNVE 
NTRTDDTLVT WNGITYVEK QMRDENRVTG 
SKETYIGYAE DLGNGLIARL QVITEEQKIK 
HQDTINKPTI LLFIL 



SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 
TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
HVSYDEYFSD EQEKITETAC GLFIVNRNII 



EF025-3 (SEQ ID NO:95) 
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AAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 

AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG 
GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC 
TATTATAAAA ATGAAGGTGT ATTTACTGGC GGAAATTACG GCATTGTAGA GATTATTACG. 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
AGCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA GCTATGATGA ATAC TTTTC A 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT TCATTTTG 

EF025-4 (SEQ ID NO: 96) 

TKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 

DELIANQLTF DSHEYTYEW TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NTRTDDTLVT WNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII 
HQDTINKPTI LLFIL 



EF026-1 (SEQ ID NO: 97) 



TGAGTGTATG ATTACTCATT 
TTTTTTGGAG GGATTTTCAT 
GCTGCTCTTG TGTTGCTTAG 
AGCAAAGAAA CAGCTAATTC 
CCTGAAGAGC TCGAAATGGC 
AATGTCACTT TTGATAAAGA 
TCCAACGATG TCTATCGTAA 
GCTGAATATG AAATCACGGT 
CACGGGACTG TCAAAGGTGA 
AAAGTTAATG GCAATATTAC 
AAAGATGGTG CCACTGTTAC 



TCCCTTTGAA TCAGTTATGA 
GAAAATGTCT AAAGTACTCA 
TGCTTGTTCA TCTGATAAAA 
AAGTACAGAA GTAGTCTCTG 
GTTAAGTGAT AAAGGAAATT 
GGTAACAGTT GCTGGTACTT 
ATTAGCACTT TATTCCCAAG 
TCCTAAGCTA ATCGTTTCTT 
TATTGAGGTG AAAGCAAATG 
TTTTGATAAA CAAGAATACA 
TGGTGAAGTC ACCGTAGCCA 



TAAAGGAAGA AATAAATAAA 
CCACTGTTTT GACGGCAACT 
AAACAGATAG TAGTTCTAGT 
GTGCTTCAAT TAGTGCCAAG 
GGATTGTCGC AGCTACTGAC 
TCCATGATAA GGGGAAAGAT 
ATGATAATAA AAAAGTAACT 
CTGAAAATTT CAACATCGTT 
GCTTTACTTT AAATGGTACC 
AAGATTCTGC TGACTTAGAA 
ATAATTAA 



EF026-2 (SEQ ID NO: 98) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF026-3 (SEQ ID NO: 99) 



AACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT 
AATGTCACTT TTGATAAAGA GGTAACAGTT 



GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
AAAGGAAATT GGATTGTCGC AGCTACTGAC 
GCTGGTACTT TCCATGATAA GGGGAAAGAT 
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TCCAACGATG TCTATCGTAA ATTAGCACTT 
GCTGAATATG AAATCACGGT TCCTAAGCTA 
CACGGGACTG TCAAAGGTGA TATTGAGGTG 
AAAGTTAATG GCAATATTAC TTTTGATAAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC 



TATTCCCAAG ATGATAATAA AAAAGTAACT 
ATCGTTTCTT CTGAAAATTT CAACATCGTT 
AAAGCAAATG GCTTTACTTT AAATGGTACC 
CAAGAATACA AAGATTCTGC TGACTTAGAA 
ACCGTAGCCA ATAAT 



EF026-4 (SEQ ID NO:100) 
TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF027-1 (SEQ ID NO: 101) 



TTTGGTATGA AAC AGAAAAA GTGGTTAATC 
GCATGTGGAA GTGGCGGTTC GAAAACGACC 
GTCGCATCTG GTGGTGAACT CTCGACATTA 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA 
AAGTTACGAG AAGCAAAATG GACAAACGGG 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC 
TTTAAAAATG GGCGTGCGGT GCGGGAAGGA 
GCAATCGATG ACCAGACACT AGAACTAACA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT 
GATGGCAATT CCGAAACTTG GAAATTGAAG 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA 
TTTGATAATG GCGACTTAGA TTACACTGTT 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA 
CGCCGTGAAA TTACCGGCAA CGAACATGTT 
GAAACTTTTG CAAAAGAAAT TTTAGGAGAT 
GCTAATTTTG CAAAAATCCA GATACAGGTG 
TGCCATATAA TATTAAAGAA GCCCAAGCTA 



GGACTTGTTG CACTGGGCTT GGTTTTAGCA 
TCAAACGAAC CAGCTACACA GAAAATTAAC 
GACAGCGCTC ATTATACAGA TGTCTATAGT 
TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GTTAGTGAAG ACGGGTTAGT CTATACATTC 
GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GGTTCAAGTA GCAGTAATCA AATGGATATT 
CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
AAGAATGATC ATTATTGGGA TAAAGAACAC 
GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTAGCAGATA CTTATGCACT TCAGTATAAA 
GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GGCTCGACAG CTTTAAATGG NTTTGTACCA 
AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
ACTGGAACAA TT 



EF027-2 (SEQ ID NO: 102) 

MKQKKWLI GLVALGLVLA ACGSGGSKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS 
SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL TGTI 



EF027-3 (SEQ ID NO: 103) 



AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 

GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 
AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 
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TTTAAAAATG GGCGTGCGGT GCGGGAAGGA 
GCAATCGATG ACCAGACACT AGAACTAACA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT 
GATGGCAATT CCGAAACTTG GAAATTGAAG 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA 
TTTGATAATG GCGACTTAGA TTACACTGTT 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA 
CGCCGTGAAA TTACCGGCAA CGAACATGTT 
GAAACTTTTG CAAAAGAAAT TTTAGGAGAT 
GCTAATTTTG CAAAAATCCA GATACAGGTG 
TGCCATATAA TATTAAAGAA GCCCAAGCTA 



CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
AAGAATGATC ATTATTGGGA TAAAGAACAC 
GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTAGCAGATA CTTATGCACT TCAGTATAAA 
GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GGCTCGACAG CTTTAAATGG NTTTGTACCA 
AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
A 



EF027-4 (SEQ ID NO:104) 



TT SNEPATQKIN VASGGELSTL DSAHYTDVYS 

SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL 



EF028-1 (SEQ ID NO:105) 

TAACAGAAGC AATACAACAA CTTAACACTT 
AAGACTTGTT ATAGTCAATG TATGGGTAGA 
AGAGCTTTGC TAGGGGTTAC CTTATTAACA 
TCTGAACAGA AAAGCGGCGA AAAACAAACA 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG 
ATTGCACTCG ATAATGACAA GTCCAAAACA 
GGGAAATCAA CGGGTCTTGT AGCAACATCT 
GGCGCACATA ATGTTTCACG CAAAAATATG 
CAAATCGACG GACAACACAA AGTCGATGTG 
CGGAAAGATC GTGATTTAGT CAAAGAATTT 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC 
CTACCTAAAA TGATTGACCG AACGGAAGAA 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA 
ATTGATTGGG CCGGGCATAG CAATGATATT 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT 
CAACTGCAGA TCATTCAACA GGGGGCTTGT 



TGTTTACTTG TTATTTATCA GAAATCAACT 
TATGAAGGAG GAAACAAGGA AATGAAGAAA 
TTCACAACAT TAGCGGGTTG TACAAATTTA 
GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
ATGATTGGAG ATGGCATGGG GAATCCGTAT 
CACTCAGACA AGCGTGTTCC CCAAACAGCT 
ACTTATCCAG AAGATGAAGA AGAGAATGTC 
GCTGCCGGAG TGAAAACCTA TAATAATGCT 
GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GAAATAACAC ATGCAACCCC TGCTGCATAT 
GCAGAAATCG CCGATGACTA TTTTGATGAT 
TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CTTTAG 



EF028-2 (SEQ ID NO: 106) 



MKKR ALLGVTLLTF TTLAGCTNLS 
EQKSGEKQTE VAEAKATESE KASVKNVIFM 
DTYLVGQQAT YPEDEEENVT DSASAATAMA 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV 



IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
ILGLFAPGGL, PKMIDRTEEV PSLADMTEAA 
GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
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LQIIQQGACL 

EF028-3 (SEQ ID NO: 107), 

ACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGGCATGGG GAATCCGTAT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT 
ATTGCACTCG ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GGGAAATCAA CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT 
GGCGCACATA ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT 
CAAATCGACG GACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
CGGAAAGATC GTGATTTAGT CAAAGAATTT TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
CTACCTAAAA TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
ATTGATTGGG CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CAACTGCAGA TCATTCAACA GGGGGCTTGT CTT 

EF028-4 (SEQ ID NO: 108) 

QKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
DTYLVGQQAT YPEDEEENVT DSASAATAMA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
LQIIQQGACL 

EF029-1 (SEQ ID NO: 109) 

TGAAGGAGGG AGAAAATGAA AAAGTTAATC GGTAAAAAGT GGCTGCTGCT TACAGCAGTA 
GCCACTTTTT TATTATCAGG ATGCGCAAGT CTTGAACAAA AAGCACAGGA TAGTGTAAAA 
GAAGTTACTG AAAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 
GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATGTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAATAA 

EF029-2 (SEQ ID NO: 110) 

MKKLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 

EF029-3 (SEQ ID NO: 111) 

AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 

GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
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CGGTTTTTAT TAATTGACAC 
GGATTGGAAG CTAGCAAACG 
GAATATGATA AGGGCGATAA 
GATGGAACAT TACTACAAAA 
AAAGAGCCTA CAACTAAGTA 
GAGTCACTCG GAATCTGGAG 



ACCCGAGACT GTGAAACCGA 
CACAAAAGAG CTTTTGTCTA 
AACAGATCGT TACGGACGAG 
AACGCTTGTA AGTGAAGGAT 
TTTGGCAGAA CTAGAGCAAG 
CATACCAGGT TATGTGACAC 



AAACAAAAGT TCAGCCGTTC 
CTGCTTCAGA AATTACGTTT 
CGTTGGGCTA CATATTCGTA 
TAGCTCGTGT TGCCTATGTA 
CCCAAGAACA GGCTAAAAAT 
AACGGGGGTT TAGTAAA 



EF029-4 (SEQ ID NO: 112) 



NVTQTIS NDQRIPADFV 

RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 



EF030-1 (SEQ ID NO:113) 



TGATTGACAC ATAGGGGGAA TAGTATGAAA 
GTTAGTACGG TCTTGGTAGG TTGTGGCACA 
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT 
TCAATGGATT CTATTTTTAC ACAAGATGAA 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT 
CCAGAAATTT CTGAGGATGG CAAACGATAT 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA 
GAAATGATTG TATCTAATGG TCCGTTTTTA 
TGGGATTTTG TGCGTAATCC CTACTATTAC 
CATTTTGAAG TTCTTAAAGA AACCAATACC 
GATGTGGCTG TCTTAACAGG AGATTTTGCT 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA 
TTAGTCGATA ATATTTTAGC AGATGGCTCA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA 
GTAGCCATTG AACTTCTTTC AAGAGATGGT 



AAGTTAAAAA TGATGGGGAT TATGTTATTT 
ACAGCAGANA 1 CAAAAATAGA CGAGAAAGCA 
TTAAATTTAA TGGAGAACTC GGAAATCGGT 
GCCAGTATTA ACGCACAGTC CAATGTCTTT 
CAACTAATAC CTGCTGCTGC TAAAGAGATG 
ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
TTGGGTGTCA AAGCGCTTGA TG ATT AT ACT 
TATTTTACGT CGTTATTGGC ATTTTCTCCA 
GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
ATGAAAAATT GGGATCAGTC AGCGATGTCG 
GATAAAGAAA AAGTAAAATC AGAAACGATT 
GTTTATAATT TGTACGAATC AGGTGAATTA 
AAACAAAATC GAGACAACCC AGACTATGAA 
CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GATAGTGACC GA 



EF030-2 (SEQ ID NO: 114) 



MKK LKMMGIMLFV STVLVGCGTT AXTKIDEKAT EKTSVSKKVL NLMENSEIGS 
MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 
NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
IERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNILADGSK EIYGYIPEKF 
VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDGD SDR 



EF030-3 (SEQ ID NO:115) 
GAGAAAGCA 

ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT 
TCAATGGATT CTATTTTTAC ACAAGATGAA 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT 



TTAAATTTAA TGGAGAACTC GGAAATCGGT 
GCCAGTATTA ACGCACAGTC CAATGTCTTT 
CAACTAATAC CTGCTGCTGC TAAAGAGATG 
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CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT 

EF030-4 (SEQ ID NO: 116) 



EKAT EKTSVSKKVL NLMENSEIGS 
MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ 
NGDAVTANDF VFAWRKLANP KNQANYFFLL 
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE 
DFVRNPYYYD KEKVKSETIH FEVLKETNTV 
IERSKVYSLR LNQKRNEKPS IFANENVRKA 
VYNPETNEDF RQEAGALVKT DAKKAKEYLD 

EF031-1 (SEQ ID NO: 117) 



LIPAAAKEMP EISEDGKRYT IKLREDGKWS 
EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
LAYALDKKSL VDNILADGSK EIYGYIPEKF 
KAKAELNGDV AIELLSRDG 



TGAGAAATTA GTTATTTTAG AAAAATAAAA ACCATTTTGG AGGAAGATTT AAAAATGAAA 
AAACGCGTAA TTTTAGGGAC ATTAGTCGCT GCAACGTTAT TAATGACTGC TTGTGGAAAC 
AGCGAAGCAA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAACG TGA 

EFO031-2 (SEQ ID NO:118) 

MKK RVILGTLVAA TLLMTACGNS EATTKSESKG GSNALWSTF 

GLSEDIVKKD IIAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKlrGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQKT 



EF031-3 (SEQ ID NO: 119) 
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AA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAA 

EF031-4 <SEQ ID NO:120) 

TTKSESKG GSNALWSTF 

GLSEDIVKKD I I APFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQK 



EF032-1 {SEQ ID NO:121) 

TGAATAAATT ATTTAGGAGG AATTATGATG AAAAAATTAA TTAGTTTAGG ATTGGTTTGT 
GTTTGTGGTA TTTCACTACT TACTGCTTGT NCGGGAAATA ATGATAATAA AGATACTGAA 
AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 
CAACTAGATG ATAAATCGAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA 
AATACTAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 
GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GGTAA 



EF032-2 (SEQ ID NO:122) 

MK KLISLGLVCV CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 
IIGVIPTMDA KRW 



EF032-3 (SEQ ID NO: 123) 
TA ATGATAATAA AGATACTGAA 

AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 



WO 98/50554 



PCT/US98/08959 



113 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

CAACTAGATG ATAAATCGAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA 
AATACTAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 
GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GG 



EF032-4 (SEQ ID NO : 124) 

NDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE 
IIGVIPTMDA KRW 



TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 



EF033-1 (SEQ ID NO:125) 

TGACTGCTTT TTTTCTATTG GAGAAAAAAG TGGTTTTTTT GTATTGTTTT GACGTTGAGA 
CAAAGGAGGT TCATTTCAGA AAATTTTCCC CAAAATAAAA TAGACGAATG CGAGGATGAA 
AAAATGAAAA AATTTACTTT AACAATGATG ACTTTAGGTT TAGTAGCAAC ACTTGGCTTA 
GCAGGATGTG GTAAACAGGA AAAGAAAGCA ACTACCTCTT CTGAAAAAAC AGAAGTAACG 
TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 



EF033-2 (SEQ ID NO:126) 
MKKFTLTMMT LGLVATLGLA 

GCGKQEKKAT TSSEKTEVTL PTKDRSGKEI TL?KEATKII SLVPSTTEVI EDLGKTDQLI 
AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 
WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF033-3 (SEQ ID NO: 127) 

CTCTT CTGAAAAAAC AGAAGTAACG 

TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 
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EF033-4 (SEQ ID NO: 128) 

SSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 

AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 

WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF034-1 (SEQ ID NO: 129) 

TAGGAGGGAG TAATCATGAA AAAAATCGGG TATTTTAGTT GTATTATTTT TTTCATGTTT 
TTGGTAGGTT GTAGTAATAA CAAAAAAGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 
CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-2 (SEQ ID NO:130) 

MKKIGY FSCIIFFMFL VGCSNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 
nkdkimtlek EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 

EF034-3 (SEQ ID NO: 131) 

AGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 

CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-4 (SEQ ID NO: 132) 

KEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 

NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 



EF035-1 (SEQ ID NO:133) 

TAAACGAGAG GTGAGTTTAT GAAAACAAAA 
TTATTCACAA GTTTCCTTTT ACTGAGTGGT 
ACAATTGATC GACAGAAAGA AAAAGTCGAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA 
AAAATCGATA CTACTGAGTA A 

EF035-2 (SEQ ID NO:134) 



ATCGGAAAAA CAGTTATCTT GTCAGCATTT 
TGTACCTCGG CTGGCGAAGA GATGGAAAAA 
AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GTTGACCGTT CTTTAGATAG TCAAGAAGAC 



MKTKI GKTVILSAFL FTSFLLLSGC TSAGEEMEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 
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EF035-3 (SEQ ID NO: 135) 
GATGGAAAAA 

ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAG 

EF035-4 (SEQ ID NO: 136) 



MEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 



EF036-1 (SEQ ID NO:137) 

TAATTTTCAA GTCCTACATA TAATGGTAAA ATAGAATGGA TTGAAATTAA TTGGAGGAAT 
AATGAATCGA TGAAAAAAAG ATTGCTATTA TTTATTGGTT TGGCAAGTAT ACTTACTTTG 
ACAGGATGTG CAAAATGGAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 
TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA ATAA 

EF-36-2 (SEQ ID NO: 138) 

MKKRLLLF IGLASILTLT GCAKWIDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKNVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 
QGNVIK 

EF036-3 (SEQ ID NO: 139) 

GAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 

TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
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GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA A 

EF036-4 (SEQ ID NO: 140) 

IDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKNVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 
QGNVIK 



EF037-1 (SEQ ID NO: 141) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAA 

EF037-2 (SEQ ID NO:142) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF037-3 (SEQ ID NO: 143) 

AACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA A 



EF037-4 (SEQ ID NO: 144) 
TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI 



GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
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EF038-1 (SEQ ID NO: 145) 

TAATGGCCAT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT 
GAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA 
TNGAATAAGA GGTGTCTTTG A 

EF038-2 (SEQ ID NO: 146) 

M KLLKWRWQWN KDHKKGEVSM KKVLPFIALV GLLLLSGCGT DMKKILTADG 
GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 

EF038-3 (SEQ ID NO: 147) 

TTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC 
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT 
GAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC 
TNGAATAAGA GGTGTCTTTG A 

EF038-4 (SEQ ID NO: 148) 

CGT DMKKILTADG 

GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 



EF039-1 (SEQ ID NO:149) 

TAAATATATC AAAAAGAAAA AAGGGGATTA CCAACCATGA AAAAGAAAAA AGTTTTTAGT 
GCGCTTACCT TATTAACCTT TAGTACGTTG TTGATTGCAG GCTGTGCTGG CGGAGCCAAC 
TCTGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 



TTTTGATGAT 
TTACGATGAA 
GGAAAATGTT 
TGATAAAAAA 
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TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 
EF039-2 (SEQ ID NO: 150) 

MKKKKVFSA LTLLTFSTLL IAGCAGGANS ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVS IGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF039-3 (SEQ ID NO:151) 

TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GAGGAAAGAT 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 
TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 

EF039-4 (SEQ ID NO: 152) 

ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVS IGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF040-1 (SEQ ID NO: 153) 

TAGATTAGAA CCACTGGAGA AAAATCTCAT ATTTCTCTCG AGGAAAGGAA GTTGAGCACA 
ATGAACAAAA AAATTTTAAT GGGGCTATTA AGTGTCGTGA CCATTCCATT ACTTGCTGCG 
TGTCAAGGAG GAGAAACACC TTCCGCAGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 
AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 
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AAAATTTTAG CACAGTACAA CTAA 

i 

EF040-2 (SEQ ID NO: 154) 

M NKKILMGLLS WTIPLLAAC QGGETPSAAS KNSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV 
DTEGNTPLNI AVHNNDIEIA KALIDRGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA 
TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ 
LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN 



EF040-3 (SEQ ID NO: 155) 



AGCG TCAAAAAATA GTCAAJ 
AGTAGTGCAA AAACTGAAAG 
GAGGAAGTGA AAGAACCGAT 
CAACGAGATA CGAAGAAGGT 
GTCGACACAG AAGGCAACAC 
GCAAAAGCGT TGATTGATCG 
CCCTATCTTT ATGCGGGAGC 
GCGACCCCAG ATTTAAATAA 
GAAAAAGGAC ATATTGACAA 
TTCCAAAATG ACTTTGGCTA 
CAACTTTACC AAGATATTGT 
GACAATTCTG GTCGAACAGC 
AAAATTTTAG CACAGTACAA 



^CGGT GACTACTCAA 
CACCAGTACA ACCCGTTCGG 
GAAGACCTAT GAAGTGGGTG 
CAAGGAAATT TTACAAGATA 
ACCGCTCAAT ATCGCTGTTC 
GGGTGCCGAT ATTAATCTGC 
GCAAGGACGT ACGGAGATTT 
GCATAACCGT TACGGTGGCA 
TGTGAAGCTC TTGTTAGAAG 
TACAGCATTG ATTGAGGCAG 
AAAATTGTTA ATGGAAAATG 
AATGGACTAT GCCAATCAAA 
C 



TAGCTCAAAC AACATCAAAA 
CGCTTTTAGA AGCAGCCAAT 
CTACTTATCA AGTGGATGAA 
ACAATAATGA CATTGAGATT 
AAAACAGCAT TAGTGATAGT 
TAGCGTATAT GTTAAAACAT 
ATGCGTTAAT TCCGGCAGCT 
ATGGACGAGA AGACATAGAT 
TGGGGTTACG TGAAGGGAAC 
GTGCGGATCA ATCCATTAAA 
AAGGTTATAC GGAAATTAGT 



EF040-4 (SEQ ID NO: 156) 



AS KNSQTVTTQS 
SAKTESTSTT RSVAQTTSKE 
DTEGNTPLNI AVHNNDIEIA 
TPDLNKHNRY GGNALIPAAE 
LYQDIVKLLM ENGADQSIKD 



EVKEPMKTYE VGALLEAANQ 
KALIDRGADI NLQNSISDSP 
KGHIDNVKLL LEDGREDIDF 
NSGRTAMDYA NQKGYTEISK 



RDTKKVKEIL QDTTYQVDEV 
YLYAGAQGRT EILAYMLKHA 
QNDFGYTALI EAVGLREGNQ 
ILAQYN 



EF041-1 (SEQ ID NO: 157) 

TAATTATTAA NTTCTGATTT TTCAGAAAAT 
ATGAAATTGA AAAAGTCATT AACATTCGGT 
GCGGCTTGTG GAGGCGGCGG AACGTCAGAT 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA 
TTAGACAAAG ATAACAAAGT CCAACCTGCA 
GATGGACTAA CATACAAAAT TAAATTAAAT 
GTGACTGCTA ATGACTATGT TTACGGATGG 
GAATATGCTT ATCTGTATGC CTCTGTAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC 
AAAGCAACAC CATACTTTGA TTACTTATTA 
GACATTGTGG AAAAATATGG TAAAAATTAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT 
AAAAACGATC AATATTGGGA TAAAGATACT 
GTGAAAGAAT CACCAACCGC GTTGAACTTG 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC 
GCATCAACAC AATATATGGA ACTAAATCAA 



ACAGATTGCA TTATTTTAGG AGGCAACACT 
GTGATTACAT TATTTAGCGT AACAACTTTA 
AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
CAACAAGAAA TGCCAACAGC TGATTTATCA 
TTAAATAATG TATATGAAGG AATTTATCGT 
GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
AAAGATGCAA AATGGTCAGA CGGTAAACCA 
CAACGAACAG TTGATCCAGC GACAGCTTCT 
AATGGTGATG CCATTGCTAA AGGGGAAAAA 
AGTGATACAG AATTAGAAAT CACTTTAGAA 
GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GCATCAAACA GCGAAAGTGC TGTCTACAAT 
CCTGGTACAG ATACAAAATG GTCATTCAAG 
GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
TTCCAAGATG GACAAACAGA CGATGTCGTT 
AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
CGTGATGAAA AATCACCATT TAGAAATGCG 
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AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG GGTTAGTTGA ATCAATCCTT 
AGGGGATGG 



EF041-2 (SEQ ID NO: 158) 

M KLKKSLTFGV ITLFSVTTLA ACGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 



EF041-3 (SEQ ID NO:159) 

TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC TGATTTATCA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG AATTTATCGT 
TTAGACAAAG ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
GATGGACTAA CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA CGGTAAACCA 
GTGACTGCTA ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC GACAGCTTCT 
GAATATGCTT ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA AGGGGAAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC AGTGATACAG AATTAGAAAT CACTTTAGAA 
AAAGCAACAC CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GACATTGTGG AAAAATATGG TAAAAATTAT GCATCAAACA GCGAAAGTGC TGTCTACAAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG GTCATTCAAG 
AAAAACGATC AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
GTGAAAGAAT CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA CGATGTCGTT 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
GCATCAACAC AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT TAGAAATGCG 
AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 

EF041-4 (SEQ ID NO:160) 
CGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN . LRKAISYSID RKALVESILR 
GW 

EF044-1 (SEQ ID NO:161) 

TAAGATAAAA TTAGTTATAG CGTCTATAGG AGGAATAGTA TGAAAAAATT AGTTTGTGTT 
ATTTTAGTTA TTTTTTTAAC AGGTTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 
GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
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GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAAT AA 

EF044-2 (SEQ ID NO: 162) 

MKKLVCVI LVIFLTGCSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 

MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 

IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 

IQ 

EF044-3 (SEQ ID NO: 163) 

TTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 

GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAA 



EF044-4 (SEQ ID NO:164) 

CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 
MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 
IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 
IQ 

EF045-1 (SEQ ID NO: 165) 

TAGCCAAAAA ATGAGGGAGG AAAAGAGATG AACAAGAAAC GGATTTTAGG TGCAATCACG 
TTAGCTTCTG TGTTAGTATT CGGGTTAGCT GCATGTGGTG GCGGCAATAA AGGCGGGGGC 
AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
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GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAATAA 

EF045-2 (SEQ ID NO: 166) 

MN KKRILGAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL PVNDRWDFT WAVDTKDNPW ATVGVTADSR 
K 

EF045-3 (SEQ ID NO: 167) 
ATGTGGTG GCGGCAATAA AGGCGGGGGC 

AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
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GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAA 



EF045-4 (SEQ ID NO: 168) 
CGGGNKGGGN KATETEDISK MPIAVKNDKK 

AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL PVNDRWDFT WAVDTKDNPW ATVGVTADSR 
K 



EF046-1 (SEQ ID NO:169) 

TAGGAGGATA TAATGAAAAA AAAACTTATT GTACTATTGT TAGCCTTATT TTTAACGGCA 
TGTAGTAATA ATACTGGGGG AAAAAATAGC GACGCTTCAT CTACTGAAGT ATCAACTAAG 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT AGTAGTAATC CGGACACAAC ACCAACTTCT 
ACATCATCTA TAACAATTGA AACAACCGAG AATTTAAAGA ATAGAGAATT GAATCCAACA 
GATGATGTTT CAAAAACTAG ACGACAATTG TATGAACAAG GAATTAACAG TTCAACAATT 
ACGGATAAAG AACTAAAGGA ATATATATCA GAGGCTAAAG AACAAAAGAA AGATGTCATT 
AATTATATTA AGCAAAAA 



EF046-2 (SEQ ID NO: 170} 

MKKKLIV LLLALFLTAC SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 
SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 



EF046-3 (SEQ ID NO: 171) 
A 

TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AACTAAAGGA ATATATATCA 
AATTATATTA AGCAAAAA 



GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 
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EF046-4 (SEQ ID NO: 172) 

C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 

SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 

EF047-1 (SEQ ID NO:173) 

TAGGGAAAAC AAGGAGGAAT TCTTATGAAA AAGATAGGGC TTATTTCTAG TGCTTTTCTT 
TTAACCCTTG CTTTAGCAGC ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 
TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EF047-2 (SEQ ID NO: 174) 

MKK IGLISSAFLL TLALAACGGG KSTENTDSRS SAAESTTVES TKASATKESS 
SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 

EF047-3 (SEQ ID NO:175) 

ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 

TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 



EF047-4 {SEQ ID NO: 176) 
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CGGG KSTENTDSRS SAAESTTVES TKASATKESS 

SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQI ID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 



EF048-1 (SEQ ID NO: 177) 

TAAGGAGAAA AGTTCATGAA AAAAAGAAAG GTTTTATTTA CAGCAGTTAT GGTATTGGCA 
GGATTACAGT TGCTAAGTGG TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 
TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-2 (SEQ ID NO: 178) 

MKKRKV LFTAVMVLAG LQLLSGCGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 
LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 

EF048-3 (SEQ ID NO: 179) 

TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 

TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA. ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
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AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-4 (SEQ ID NO:180) 

CGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 

LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 



EF049-1 (SEQ ID NO: 181) 

TGAGACTCTT TCTTTTTCAA AATGAGGTAT GGTATAGTTA TAACAGANAT AAAACTANAA 
AAAACAGGAG TGCATAAGAG AATGAAGAAA AAACTAATCT TAGCTGCAGC GGGCGC AATG 
GCCGTTTTTA GTTTAGCAGC GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-2 (SEQ ID NO: 182) 

MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESKSS DSKTSDTKTS DSEKATDSSS 
KTTESSSK 

EF049-3 (SEQ ID NO: 183) 

GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 

TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
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AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-4 (SEQ ID NO: 184) 



CSSG SKDIATMKGS 
TITVDDFYNQ IKEQSTSQQA 
FSDALKQAGL TEKTFKKQLK 
EDDAKAVKKE ITDGGDFTKI 
EVSEPIAATN MQTYQTTYYV 
DELKAANVKI KDDAFKNALA 
KTTESSSK 



FSQMVIYKVF EEKYGDKVTD 
QRAAYDAGLK AHLKITDEDL 
AKEKSTDTAT KKDGGKIKFD 
VKMTKNKAKG NDMKPYEKEI 
GYMQTESSSA SSEKKESKSS 



KXIQKNFDEA KEQVEAQGGK 
KTAWASFHPE VEAQIIQVAS 
SQATTVPAEV KEAAFKLKDG 
KKIAEETKLA DQTFVSKVIS 
DSKTSDTKTS DSEKATDSSS 



EF050-1 {SEQ ID NO: 185) 

TAGGGTCTGG AAAAGCAGTC AACTGACTTC TTTTCCAAGC CCTTTTTTAG TTCATCGCAG 
AAAGGATGNA AAAAAATGAA CATGCCCAAA AATATCNGTT ATTTTTCTTT GCTAATGGGT 
CTTGTTCTAT TATTAAGTGC TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC 
GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGGAACTA 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC CGAGTAAAAA ATAG 

EF050-2 (SEQ ID NO: 186) 

MNMPKN IXYFSLLMGL VLLLSACQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE 
PEVDQLLIEF LWPSKK 

EF050-3 (SEQ ID NO: 187) 



PDGTWKERG 
SMGGVSGLRY 
LDYQEMINW 
NAQHSQLHEN 
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TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC 

GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGGAACTA 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC CGAGTAAAAA ATAG 



EF050-4 (SEQ ID NO: 188) 



CQIG ATTKDDNQAA 
TKEATVELNR TTTPTLFFHG 
ALSGKATNPS VQVLFEDNKN 
LGTYGQDTSL PKIEKFVSIG 
PEKLPILLIG GQLSPTDLSD 
PEVDQLLIEF LWPSKK 



YAGTKNSFGS LLHRLEKQGA 
NEWNQTEWIK NTLLYLQKNY 
APFNDFIDTS QQQTIETELE 
GTVPLSSALA VNALLRQRGT 



TTQELVLLVK PDGTWKERG 
QVNKANIVGH SMGGVSGLRY 
NGPTEKSSRY LDYQEMINW 
QVTSQIIKGE NAQHSQLHEN 



EF051-1 (SEQ ID NO: 189) 

TAAAAGAAAA GAGGCGTTCA AATGTCTAAA CAAAAAAAGG CTGTGTTCCT GCTTAGTTTA 
TTCAGTTTAG TTGCCCTAAT TGCTGCATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 
ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-2 (SEQ ID NO:190) 

MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA ASLESVMEKK 
IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 

EF051-3 (SEQ ID NO:191) 

ATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 

ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
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AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-4 (SEQ ID NO: 192) 

CT NQPQKETVST KKEEITLAAA ASLESVMEKK 

IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 



EF052-1 (SEQ ID NO: 193) 

TAAAGTAGGA GAAGCGCAAG CGAAAAAAGT 
CCCACAATGG GTACCATGGG TAGCATTATC 
TTACTTAGTT CGTCGTGGAG AGAAGTGGAA 
NGAAATCTTC NGTTTTTATT ATTGTTGGTT 
GCAGAAAATA GGGAGACCAC AGAAGTCGGA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG 
GTTCGTAGCA GAACGCAAAT AAAAAGATTA 
CTAAGCTGGT TTGGCATATT GTTTTTAATA 
TTATGTAGAA AAGGAGAATA A 



GAATCAATCG GCAGCGTATC AAGTAGTGAT 
TTTGACAGTA GCACTTGCTG GATTGATTGC 
AAACGAAGGG GAAGTGACAT AATGAGANGA 
CTATTAATTT ATATTCCTCA AACAACTTAT 
ATCGGGTTTA CAAAAACTTC AGACATACCA 
CCGCAAACAA CCATTCAATC GCTATCAATC 
CCTAAAACTG GTGACAATCG AATAACTTGG 
AGTAGTTTTT GGCTGTTTCT ATTTAGACAA 



EF052-2 (SEQ ID NO: 194) 
MRXX 

NLXFLLLLVL LIYIPQTTYA ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE 

EF052-3 (SEQ ID NO:195) 



AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA CAAAAACTTC AGACATACCA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTCGTAGCA GAACGCAAAT . AAAAAGAT 



EF052-4 (SEQ ID NO: 196) 

ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKR 



EF053-1 (SEQ ID NO: 197) 



TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGAGATGA AAAAATACCT TTTGCTTAGT 
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TGTTTTTTAG GTCTTTTCAG CTTCTGTCAT TCAGACACTG CGTTTGGAGA AGCAGCTTAT 
GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGAC AAG . GAAGATTACC AGCGACAGGT 
ACCACCAATC AAGCACCATT TATTTATTTG GGAATCAGCC TTATCACTAT AGGCATATTA 
TTTATTAAAA GGAGAAGAGA AGATGAAAAA AACAGTATTA GCAGTAGTAG GGATTGTAGG 
ATTTAG 

EF053-2 (SEQ ID NO: 198) 

MKKYLLLSC FLGLFSFCHS DTAFGEAAYE NSGWSFYGT YEYPTEESTT 

ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGRLPATGT TNQAPFIYLG ISLITIGILF 

IKRRREDEKN SISSSRDCRI 

EF053-3 (SEQ ID NO: 199) 

TTTGGAGA AGCAGCTTAT 

GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGA 

EF053-4 (SEQ ID NO:200) 

FGEAAYE NSGWSFYGT YEYPTEESTT 
ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR 



EF054-1 (SEQ ID NO:201) 



TAAATAAAAA ATTATTTGGA GGAAATTACA 
AGTGCAGTAC TAGTATTCGG TGGCGGAAGT 
ACAGATCCAG CAACTCCACC AATTACCGAA 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG 
ATTGATCCTG GAACGCCGGT TGAACCGACT 
CCAACCGAGC CTACAACACC AAGCGAACCA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA 
GAACAGCCCA CTAAACCAAT AGACGTAGTC 
AATCAATCGG CAGGAACACA ACCAAGTATT 
ACACATGTAC CAAGTGAAAC TACTCCAATT 
GTAGATAAAG GTGTTCCGTT AACCAAAACA 
TATAAGGTTT TACCTAGCGG AAACGTTGAG 
TTGCCACATA CAGGAGAGAA ATTCACACTC 
TTAATTTCAG GATTCTTTTT CTTTAAAAAG 



ATGAAAAAAA TTATTTTATC AAGCTTGTTT 
ATAACAGCAT TCGCTGACGA TTTAGGACCA 
CCAACTGATT CTAGTGAACC TACGAATCCT 
GTAATACCAA CTGATCCAAC AGAACCAAGC 
CCAGAAAAGC CAACAGAACC AACAACGCCA 
GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
GAAAAACCTG TTACTCCAGA ACAACCGAAA 
CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GTTACACCTA GTGGAGAAAT TGATAAAACG 
CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
CTTTTCTCTG TATTGGGAAG CTTCTTTGTA 
AATAAGAAAA AAGCTTAA 



EF054-2 (SEQ ID NO:202) 



M KKIILSSLFS AVLVFGGGSI TAFADDLGPT DPATPPITEP TDSSEPTNPT 
EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKVL PHTGEKFTLL FSVLGSFFVL I SGFFFFKKN KKKA 



EF054-3 (SEQ ID NO:203) 
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A 

ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
T 

EF054-4 (SEQ ID NO:204) 
DDLGPT DPATPPITEP TDSSEPTNPT 

EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKV 



EF055-1 (SEQ ID NO:205) 

TAACAAAAGG TTGTTTTGTC TTTCTTGTGT 
GGAGGTTTTT CAATGAAAAA AAAGCGTTAT 
AGTTTTTTTA TAAATGTTGA AGCGTCTGAT 
TACCAAAATC CGAGAACACC CGCTCCTAAA 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG 
CAGACCACCA CAACTGGCTC AACGCTCCCT 
AGCATTCTCN GNTTCGCCTT AATCGGTTTG 
CATGAAGCAA ACTAA ' 

EF055-2 (SEQ ID NO:206) 



EF055-3 (SEQ ID NO:207) 

AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG 
TACCAAAATC CGAGAACACC CGCTCCTAAA 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG 
CAGACCACCA CAACTGGCTC AACG 



EF055-4 (SEQ ID NO: 208) 
SDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ 



AAAAGGGCAA GAAAGGCTAG CGAGTTAAAA 
TTAATGATTG TGTGTCTACT ATCTTCTCCT 
GGTGGTTCTA GTTCGGTGGG GATTGAATTT 
GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CGTACAGGGA GCAAGAGTCA GGCAAATTTG 
GCGGGAATCG TACATAGAAA GAAGGGACGA 



TTTTGSTLPR TGSKSQANLS 



GATTGAATTT 

GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
CAAGGAGATC AACGAAGTGG TGGTTCGACA 



GDQRSGGSTQ TTTTGST 



MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFY 
QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ 
ILXFALIGLA GIVHRKKGRH EAN 
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EF056-1 (SEQ ID NO:209) 

TAAATGAAAA AAAAGCGTTA TTTAATAATT GCGTGTTTAC TATTTTCCCC TAGTTTTTTT 
ATAAATGTTG AAGCATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 
CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAGCTCCC TCGTACAGGA AGCAAGAGTC AGGCAAACCT GAGCATTCTT 
GGTCTTGTCT TGATTGGTCT TGTCGGAATG GTCCAGAGAA AGAAGGGACG ACATGAAGCA 
AACTAA 



EF056-2 (SEQ ID NO:210) 

MKKKRYLIIA CLLFSPSFFI NVEASEGGSS 
EPAGPLQGDQ RSGGSTQTTT AGSQLPRTGS 

EF056-3 (SEQ ID NO:211) 



SVGIEFYQNP ATPAPKDAPP KTDEPAADPK 
KSQANLSILG LVLIGLVGMV QRKKGRHEAN 



ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 

CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAG 

EF056-4 (SEQ ID NO:212) 

SEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQ 

EF057-1 (SEQ ID NO:213)' 

TAATGTTTAT TGGCTGGGCC AGTCAATGTT GAAAATGGGG AAGGAGGAAT TCAGATGAAA 
ATCATAAAAA GGTTTAGTTT GGTATGTTTA GGGCTATTGA TCATTGGGTT GCNAACAAAA 
AGCGNTATGG CTGAAGAAAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 
TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT TACCTAAAAC AGGCGAGTCT 
GAAAATCCGC TGTATTCCTT GATAGGAGTT AGTTTGTTGG GGATAGTCAT TTATTTAATT 
AATAAAATGA AACGAGAGAA GGAGTTTATT TAA 

EF057-2 (SEQ ID NO:214) 

MKI IKRFSLVCLG LLIIGLXTKS XMAEENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN 
KMKREKEFI 

EF057-3 (SEQ ID NO:215) 

AAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 

TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT 

EF057-4 (SEQ ID NO:216) 

EENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV 
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EF058-1 (SEQ ID NO:217) 

TGAAGAACGT TCTATTTGGT TGACGATTGC AGGCCTGCTA ATCATTGGGA TGGTAGTCAT 
TTGGCTATTT TATCAAAAAC AAAAAAGAGG AGAGAGAAAA TGAAGCAATT AAAAAAAGTT 
TGGTACACCG TTAGTACCTT GTTACTAATT TTGCCACTTT TCACAAGTGT ATTAGGGACA 
ACAACTGCAT TTGCAGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 
ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC TTCAACAGGT GGCAAAGGAA TCTACGTTTA CTTAGGAAGT 
GGCGCAGTCT TGCTACTTAT TGCAGGAGTC TACTTTGCTA GACGTAGAAA AGAAAATGCT 
TAA 

EF058-2 (SEQ ID NO:218) 
MKQLKKVW YTVSTLLLIL PLFTSVLGTT 

TAFAEENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGTLPSTGG KGIYVYLGSG AVLLLIAGVY FARRRKENA 



EF058-3 (SEQ ID NO:219) 
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AGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 

ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC T 

EF058-4 (SEQ ID NO:220) 

EENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGT 



EF059-1 (SEQ ID NO:221) 

TAGATTGGAA GAATGAAAAT GAAAAAAATG ATTATTATTG CCTTATTCAG TACAAGCCTT 
TTAGCAGGGG GAAGCAGTGT TTCTGCTTAT GCGCAAGAAT CAGAAGGAAA TCTTGGTGAA 
ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT 
GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC CAGTAGAACC AAATAAACCA 
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ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTACTTCC TTACACTGGT 
GAAAAAATGG GCATAATTGG GTCAATCGCT GGTGTATGTT TGACTGTTTT ATCAGGAATC 
TTAATTTATA AAAAACGTAA AGTGTAG 

EF059-2 (SEQ ID NO:222) 

MKKMI IIALFSTSLL AGGSSVSAYA QESEGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE QPTDVWKPN GEIATGESTQ 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 
GNVQVKSADG KMKVLPYTGE KMGIIGS IAG VCLTVLSGIL IYKKRKV 

EF059-3 (SEQ ID NO: 223) 

AGAAGGAAA TCTTGGTGAA 

ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT 
GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC CAGTAGAACC AAATAAACCA 
ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTAC 

EF059-4 (SEQ ID NO:224) 

EGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP 
GNVQVKSADG KMKV 

EF060-1 (SEQ ID NO:225) 

TGAAAAATAG ACAAGGAGCA CGCGATGATG ACAATGAAAA GTAAAGGGTC ACTTCTGGTG 
ACGTTGGGAA TACTTTTAAC CGTTGGCATT GCGAGTCTAA TTGTTTCTTC TGAGAGTTTT 
GCAGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT TACCAAGGAC AGGTGAACGA AATAGCACGT GGCTTTACAG CCTTGGTATT 
GCCTGTTTAC TCGTAGTACT AACAAGTTTC TATTATTTGA ATAAAAAAAG GAAAAAGGAA 
AAATAA 

EF060-2 (SEQ ID NO: 226) 

MMT MKSKGSLLVT LGILLTVGIA SLIVSSESFA EEVGQTNIGV TFYGGKEPLK 



TEPTTPSKPE QPTEPTTPSV 
QPTDVWKPN GEIATGESTQ 
LTQTADGLKP IKSEYKVLPS 
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TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TSLPRTGERN STWLYSLGIA 
CLLWLTSFY YLNKKRKKEK 

EF060-3 (SEQ ID NO:227) 

AGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT 

EF060-4 (SEQ ID NO:228) 
EEVGQTNIGV TFYGGKEPLK 

TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS 
EF061-1 (SEQ ID NO:229) 

TAATGGAACG ACCGACAGAA GAAGATTTTG AACTTACAAA TTAAAATTAA AATGGAGGAA 
ATAATGATGA AAAAAATTCT TTTTGCTAGT TTATTTAGTG CCACACTACT ATTTGGGGGA 
AGTGAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGC C AAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTAGA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TTTACCATAC 
ACAGGTGAAG AAATGAATAT CTTTTTATCT GCCGTAGCGG TATCTTGTCT GTAG 

EF061-2 (SEQ ID NO:230) 

MMKKILFASL FSATLLFGGS EISAFAQEII PDDTTTPPIE 
VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPTEPTTPT 
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEKPVQPAEP SKPIDWVTP 
TQQPTVPIET SNLAEITHVP SVTTPITTTD GENIVAVEKG VPLTQTAEGL 
PSGNVEVKGK DGKMKVLPYT GEEMNIFLSA VAVSCL 

EF061-3 (SEQ ID NO:231) 

GAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TT 



EPSEPEQPTE 
TGELNHAGNG 
KPIQSSYKVL 
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EF061-4 {SEQ ID NO: 232) 
QEII PDDTTTPPIE 

VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE 
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEKPVQPAEP SKPIDVWTP TGELNHAGNG 
TQQPTVPIET SNLAEITHVP SVTTPITTTD GENIVAVEKG VPLTQTAEGL KPIQSSYKVL 
PSGNVEVKGK DGKMKV 

EF062-1 (SEQ ID NO:233) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGGGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC -GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
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TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGGCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGGGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF062-2 (SEQ ID NO:234) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTF1MN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT \ATTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNfFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF062-3 (SEQ ID NO:235) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
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GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA 
GGACTAAACG CTAGTTATTT AGGACGTAAA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG 
GATTTTGGGG CNAACAATGC GTTCAAATAC 
GATGGAAAAT TTTACTCACC GGAAGATATT 
AATAGTGATT GGGACGCTGT AGGTCACAAG 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN 
ATTTTCAATT ATGGGAATCC AAAAGAACCA 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN 
NTCAATGATT TAAATGTGAA NCGTGGCGAT 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA 
GATGCAGAAA AAGTGACGAT TGATTTATCC 
CTNAACGANA AAGACTNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT 
AAACCAACCA AAGCCGTTCA TAACAAGAAA 
CGTGGTGATG TTCTTTCTTA TGAAATGACN 
GCCTTTGATA CAGTCGATCT TGCGACAGGC 
AANGTGACAC CAATCAAAGA CTTACTTCGT 
AACCAGTTCA CGATCTCNTG GGACGATGCC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT 
CGAATTAAAA CCAATACNGT TGTCAACCAT 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA 
TTCTTCTATG AATTTACAAG- TAGTGACATT 
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CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGGG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
AAAAATAAAC CAGTGACAGT GACCTATACA 
ATTACAAAAG CAGAATTTGT TTATGAACTA 
AATGCAGTAT TTTCAAACGA TCCGATTATC 
GGTAAGGATG TTAAAACACG CTTAACGATT 
CTACCAGATA AAGATAGTCC ATTTGCGTAT 
AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
ATGACAACAA AAGGAAAAAG TAATGTGCCT 
ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
GAAAAAGCAA CGATTGAATT CAATNGATAC 
AATAAAGAAG TCACTGATGG NCAGAAAAAT 
TCTTTACAAT ACATTGTGAC AGGGGATACG 
GTAACNAAAC AAGGGATTCG AGATACNTTT 
AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGACTTGAC GTTTACNATG 
GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
ACAGGTGACG TAGAAAACAC GCAAACAGAA 
ACNGTGGTGA CGCATACNCC TGATGATCCA 
GGGGAAGANA TTAANCATGG AAAAGTNGCT 
TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GTTTCTTTCT TCGATGATTA CGATGAAACG 
GTCAAAGATT CTAAAGGGGN AGACATTACG 
AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CAAGAATTGC GTGTAACNCT CCCTACAAAA 
AATTCAGCGG AACAAAATAC ATTTGGNCAA 
ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
AATGGNGCCA CAATCAAATT AGGGGAGAAN 
CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
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TGGTCGATTA GCGATAAACT AGACGTCAAA 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC 
TCGAAACTAT TCACGATGAC CTTTGAACAA 
TTTTTNGATG CGATGAATCT AAAAGAAAAC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG 
CCAGAAAAAA CAGTGATTGT ACCACCAACA 
TTAGTGGTAG AAAAGGCAAG TG 



CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
GGAACCAAAG TGAATAAAGG GGACGACATT 
GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
AAAAACGTTG CACACTCATG GAAAGCGTTC 
GTTTACAACA CAATCGAAGA ATCTTTCAAC 
ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAAAAACAC CGCAAGCACC AGTAGAGCCA 



EF062-4 (SEQ ID NO:236) 
AELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN 
GKFYSPEDID YGTG'PSGLKN SDWDAVGHKN 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD 
ISAYILLENK DNKDLTFTMN QALLAALNEG 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG 
GVERIAAGDV YNTIEESFNN EKIKTNTWT 
WEKASV 



VEEVAPENKG TEQSSATPND TTNAQQPTVG 
NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
WKITASQAF XDAMNLKENK NVAHSWKAFI 
HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 



EF063-1 {SEQ ID NO: 237) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 
GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
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ATGGGNGGGA AAGNTACTGG CAACTTCCAG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA 
GGACTAAACG CTAGTTATTT AGGACGTAAA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG 
GATTTTGGGG CNAACAATGC GTTCAAATAC 
GATGGAAAAT TTTACTCACC GGAAGATATT 
AATAGTGATT GGGACGCTGT AGGTCACAAG 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN 
ATTTTCAATT ATGGGAATCC AAAAGAACCA 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN 
NTCAATGATT TAAATGTGAA NCGTGGCGAT 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA 
GATGCAGAAA AAGTGACGAT TGATTTATCC 
CTNAACGANA AAGACTNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT 
AAACCAACCA AAGCCGTTCA TAACAAGAAA 
CGTGGTGATG TTCTTTCTTA TGAAATGACN 
GCCTTTGATA CAGTCGATCT TGCGACAGGC 
AANGTGACAC CAATCAAAGA CTTACTTCGT 
AACCAGTTCA CGATCTCNTG GGACGATGCC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT 
CGAATTAAAA CCAATACNGT TGTCAACCAT 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA 
TTCTTCTATG AATTTACAAG TAGTGACATT 
TGGTCGATTA GCGATAAACT AGACGTCAAA 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC 
TCGAAACTAT TCACGATGAC CTTTGAACAA 
TTTTTNGATG CGATGAATCT AAAAGAAAAC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG 
CCAGAAAAAA CAGTGATTGT ACCACCAACA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC 
AAACGTAGAA AAGAAACAAA ATAA 
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AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
AAAAATAAAC CAGTGACAGT GACCTATACA 
ATTACAAAAG CAGAATTTGT TTATGAACTA 
AATGCAGTAT TTTCAAACGA TCCGATTATC 
GGTAAGGATG TTAAAACACG CTTAACGATT 
CTACCAGATA AAGATAGTCC ATTTGCGTAT 
AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
ATGACAACAA AAGGAAAAAG TAATGTGCCT 
ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
GAAAAAGCAA CGATTGAATT CAATNGATAC 
AATAAAGAAG TCACTGATGG NCAGAAAAAT 
TCTTTACAAT ACATTGTGAC AGGGGATACG 
GTAACNAAAC AAGGGATTCG AGATACNTTT 
AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGACTTGAC GTTTACNATG 
GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
ACAGGTGACG TAGAAAACAC GCAAACAGAA 
ACNGTGGTGA CGCATACNCC TGATGATCCA 
GGGGAAGANA TTAANCATGG AAAAGTNGCT 
TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GTTTCTTTCT TCGATGATTA CGATGAAACG 
GTCAAAGATT CTAAAGGGGN AGACATTACG 
AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CAAGAATTGC GTGTAACNCT CCCTACAAAA 
AATTCAGCGG AACAAAATAC ATTTGGNCAA 
ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
AATGGNGCCA CAATCAAATT AGGGGAGAAN 
CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
GGAACCAAAG TGAATAAAGG GGACGACATT 
GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
AAAAACGTTG CACACTCATG GAAAGCGTTC 
GTTTACAACA CAATCGAAGA ATCTTTCAAC 
ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 



EP063-2 (SEQ ID NO: 238) 
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MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDYKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF063-3 (SEQ ID NO:239) 

GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 

AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACX3TTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AAT*rCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCOTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
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AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTG 



EF063-4 (SEQ ID NO:240) 
ELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQ PTE VAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK V 

EF064-1 (SEQ ID NO:241) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGGTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
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GATGCAGAAA 
CTNAACGANA 
GTGACTGCTT 
AACGCNGACG 
GTAGTGAAAA 
GAAACGGTAA 
AAAGCAGATA 
CAAACAAAAA 
ACNGAAGAAT 
TGGCACGCTA 
GATATTTCTG 
AATCAAGCAT 
TCTGTGTATC 
AACTACAACA 
AAACCAACCA 
CGTGGTGATG 
GCCTTTGATA 
AANGTGACAC 
AACCAGTTCA 
CCACAAGCCT 
GTCAAAGCCG 
CGAATTAAAA 
GTTATTAAAG 
TTCTTCTATG 
TGGTCGATTA 
TTTGCCAATT 
TCGAAACTAT 
TTTTTNGATG 
ATTGGTGTAG 
AATGAGAAGA 
CCAGAAAAAA 
TTAGTGGTAG 
GTCTTATTAA 
AAACGTAGAA 



AAGTGACGAT 
AAGACTNAAA 
CTTATGANCT 
GNTCNGTTGT 
ATGTAGAAGG 
CAAATACAGT 
AAAACGGTAC 
TTTATTATGA 
GGGGCATGAA 
TTACNAANTA 
CCTACATTCT 
TATTGGCNGC 
TGGAAGTCGA 
AAGAGCTTGT 
AAGCCJTTCA 
TTCTTTCTTA 
CAGTCGATCT 
CAATCAAAGA 
CGATCTCNTG 
TTATTCTAGC 
ATGTTTCTGG 
CCAATACNGT 
TNGGTGACAA 
AATTTACAAG 
GCGATAAACT 
CTAATTTTGT 
TCACGATGAC 
CGATGAATCT 
AACGAATTGC 
TTAAAACNAA 
CAGTGATTGT 
AAAAGGCAAG 
CGGTAGCTGG 
AAGAAACAAA 



TGATTTATCC 
AGCTGTTGCT 
CAATTTAGAT 
TTTAGCAATG 
CGATTTTGAA 
GATTAACCAT 
AGTTGGCAGT 
AGTGAAATCT 
TGATGTCTTG 
TGACCTTAAA 
TTTAGAAAAC 
NTTAAATGAA 
ACGGATNAAA 
NCGTTCTAAT 
TAACAAGAAA 
TGAAATGACN 
TGCGACAGGC 
CTTACTTCGT 
GGACGATGCC 
GNATGGTGGG 
NGATGTTTAT 
TGTCAACCAT 
ACAAAGTCAA 
TAGTGACATT 
AGACGTCAAA 
TTTAGCAGAC 
CTTTGAACAA 
AAAAGAAAAC 
GGCAGGAGAC 
TACGGTAGTG 
ACCACCAACA 
TGTNGTGCCA 
TAGTTTAGCC 
ATAA 



AAAGTGAAAG 
GCAGCNATTA 
CAAAACACCG 
GGGTATAAAT 
AATACAGCTG 
GTGCCAGGTA 
GTTTCTCTAC 
TCCGAACGTC 
GACACGACCC 
GTAGGGGANA 
AAAGACAATA 
GGAAGCAATA 
ACAGGTGACG 
ACNGTGGTGA 
GGGGAAGANA 
TGGGACTTAA 
GTTTCTTTCT 
GTCAAAGATT 
AAAGGCACGG 
CAAGAATTGC 
AATTCAGCGG 
ATTCCAAAAG 
AATGGNGCCA 
CCTGCAGAAT 
CATGACAAAT 
GGAACCAAAG 
GGGGTAGTGA 
AAAAACGTTG 
GTTTACAACA 
ACNCATACGC 
CCAAAAACAC 
GAATTGCCGC 
GCAATGCTTG 



TTTATCAAGC 
ATTCAGGAAN 
TCACAGCAAT 
ATTTACTTGT 
TTCAGCTGAC 
GTAATCCTTC 
ATGATAAAGA 
CAGCNAACTA 
ATGATCGTTT 
AAACGTTAAA 
AAGACTTGAC 
AAGTAGGCAA 
TAGAAAACAC 
CGCATACNCC 
TTAANCATGG 
AAGGGTACGA 
TCGATGATTA 
CTAAAGGGGN 
TGACNATNTC 
GTGTAACNCT 
AACAAAATAC 
TGAANCCTAA 
CAATCAAATT 
ACGCTGGNGT 
TTAGTGGCCA 
TGAATAAAGG 
AAATCACGGC 
CACACTCATG 
CAATCGAAGA 
CAGAAAAACC 
CGCAAGCACC 
AAACAGGCGA 
GCTTAGCAGG 



AGACGCAAGT 
AGCTAAAGAC 
GATGAAAACC 
CTTGCCGTTT 
AAANGATGGN 
CAAAGATGTA 
TATTCCGTTA 
TGGCGGAATN 
CACAGGNAAA 
AGCAGGAACA 
GTTTACNATG 
ACAAGCTTGG 
GCAAACAGAA 
TGATGATCCA 
AAAAGTNGCT 
TAAAGACTTT 
CGATGAAACG 
AGACATTACG 
TGCCAAAGAC 
CCCTACAAAA 
ATTTGGNCAA 
AAAAGACGTG 
AGGGGAGAAN 
TGTGGAAGAA 
ATGGTCTGTG 
GGACGACATT 
CAGTCAAGCC 
GAAAGCGTTC 
ATCTTTCAAC 
ACAAACNCCA 
AGTAGAGCCA 
AAAACAAAAT 
CTTAGGCTTT 



EF064-2 (SEQ ID NO:242) 

MKAKK' QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LiGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALIjAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
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FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 

EF064-3 (SEQ ID NO:243) 

AGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 

EF064-4 (SEQ ID NO: 244) 

VTIDLSK VKVYQADASL 

NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
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IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASV 



EF065-1 (SEQ ID NO:245) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGaCAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
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GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA . CAAAAAATAA AAAAAGAAAA AATTAG 

EF065-2 (SEQ ID NO:246) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKVJNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF065-3 <SEQ ID NO:247) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAAGGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
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CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CC AC TAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF065-4 {SEQ ID NO:248) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 

MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 

VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 

LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 

NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 

IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 

DINKVLDI ID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 

IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 

DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 

EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM. 

TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 

PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 

EF066-1 (SEQ ID NO:249) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 

ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 

GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 

AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 

GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 

TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 

GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 

ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 

AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 

GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 

ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 

AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 

TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
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GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGAGGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGAC A TTAATAAAGT GTTAGACATC 
AC TG ATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF066-2 (SEQ ID NO:250) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLWVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
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PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF066-3 (SEQ ID NO:251) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCA 

EF066-4 (SEQ ID. NO: 252) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVT 

EF067-1 (SEQ ID NO:253) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
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AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF067-2 (SEQ ID NO:254) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
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ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF067-3 (SEQ ID NO:255) 

GCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 

GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF067-4 (SEQ ID NO:256) 

VLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF068-1 (SEQ ID NO: 257) 

TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATCAACC AGTTAATTTC 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT GGATATCGAA 
AGTGGCACTT CAACGAGTAA TCGTTGGACA 
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
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GTTACT TTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGC AGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
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ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA. ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT GGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 



EF068-2 (SEQ ID NO:258) 



M KKKIVEDFNR KSQHKKWTKR KMLNLAISf 
SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV 
FTVTVPAGEA GANETLTAVA KNASGTEXTP 
KGTADANATV EIRNAGGTVI GTGTADGTGA 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
FTVTIPAGEA GANETLTAVA KNASGTESTP 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG 
PATATTPADV TAPTVDNITG NSGSGYEITG 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE 
ADPNATIEIR DADGNVIATG TADGTGSFAV 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA 
LASGKATAKQ TVNWAKNDT GLESQPTTAM 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV 
TATANEALTA IAKDAAGKES NPTAFKTPAD 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI 
TPGETITIIS KDGAGNESQP ATAVIPADW 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP 
TAGQATAQQS LLATATDGAG HYSTATTFMT 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS 
YLPSTGEKES SAVTTSLFGA FVALLASMGI 



;g llftslaipv 

gtstsnrwta anqnqpvnft vsggaladas 
tnvtidlskv tfltavlnaa ndltnvitqi 
lgaasftape tlaadgsyis apisdglglv 
vaaainaall pvkgtvnvav sgalpllavg 
snnldarfvg twqtdlldv nllatadgvs 
kgtadanatv eirnaggtvi gtgtadgtga 
ttfqtpadea tvtaptitgv tgnstagyev 
ftvtvpagea ganetltava knasgtestp 
kgtadanatv eirnaggavi gtgtadgtga 
ttfqtpadpn tpvatpivet vtgsttkgye 
kytvtldsgt atanqtlsw aknasgtesq 
tadpnttiev rdpsgavigt gtsdangdft 
vlvpadatvt aptvtgvtgn svagyqvtgt 

NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
TPADVTTPTI GDITGDSTTG YEITGTADPN 
GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
PDAPVATPTV DKITGSTTNG YQWGAAEVG 
GKASANETIT WAKNATGKE SQPATATTPV 
DVRDADGTII AATTANETGQ YTVTLPAGW 
LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
EKETLTALTT DTQGNVSPKT TFMTPADITG 
NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
IKRKRKN 
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EF068-3 (SEQ ID NO : 259 ) 

CTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 

CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATG AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GG AAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC 



EF068-4 (SEQ ID NO:260) 

TSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTP 

EF069-1 (SEQ ID NO:261) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
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ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA ' 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
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GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TAOGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA' 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF069-2 (SEQ ID NO:262) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTI I AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 



EF069-3 (SEQ ID NO:263) 
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AGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 

GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAA 



EF069-4 (SEQ ID NO:264) 



AGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TI IGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGT 



EF070-1 (SEQ ID NO:265) 



TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATCAACC AGTTAATTTC 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 
GTTACTTTTT TGACTGCCGT TTTGAATGCA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT GGATATCGAA 
AGTGGCACTT CAACGAGTAA TCGTTGGACA 
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
GCCAATGATT TAACCAATGT GATTACTCAA 
GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
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CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGG AC TGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
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ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA. CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF070-2 (SEQ ID NO:266) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS. ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGS.TTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 

EF070-3 (SEQ ID NO: 267) 

CGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
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GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACT 



EF70-4 (SEQ ID NO: 2 68) 
DGDGNE SQPTEVTVPE 

DATVAAPTVT TVTGTTATGY QVTGTAEPNV 
TATANEALTA IAKDAAGKES NPTAFKTPAD 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI 
TPGETITIIS KDGAGNESQP ATAVIPADW 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP 
TAGQATAQQS LLATATDGAG HYSTATTFMT 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS 
YL 



TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
PDAPVATPTV DKITGSTTNG YQWGAAEVG 
GKASANETIT WAKNATGKE SQPATATTPV 
DVRDADGTII AATTANETGQ YTVTLPAGW 
LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
EKETLTALTT DTQGNVSPKT TFMTPADITG 
NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 



EF071-1 (SEQ ID NO:269) 

TAAGTAGAAG TGGTCGGGAC AAACGTAGAA 
GTCCCGCCAT TTATCTGCAG GTTTAAGCCG 
ATGGCTTTTT TAAGAAAGGA GCATGCTATG 
GTGATTGGTT TAAGTTTAAC GATTCCGATG 
CCAATCAACT TTACTTATTT TCCCGGCTCT 
TCTGGAAACG AGCGGAACCT AGGACCACAC 
CGAAATTGGT CAAATGCTTA TGTCTCATAT 



CTTTCGCTGA TTGCCGAAGA AATTACTTCT 
TGGAAGGGAA GTTATTTTGA CTTTCCTTTC 
TTTAAAAAAT TAATGATTCA ACTTGCTTTA 
ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
GCAAGCAATG AATTAATTGT TTTACATGAA 
AGTTTAGACA ATGAAGTGGC CTATATGAAA 
TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
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TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGTA A 



EF071-2 (SEQ ID NO:270) 
MF KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFKKDYAAY 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 
TGVSXTGETG HYSAR . 

EF071-3 (SEQ ID NO:271) 

G TTTAAAAAAT TAATGATTCA ACTTGCTTTA 

GTGATTGGTT TAAGTTTAAC GATTCCGATG ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
CCAATCAACT TTACTTATTT TCCCGGCTCT GCAAGCAATG AATTAATTGT TTTACATGAA 
TCTGGAAACG AGCGGAACCT AGGACCACAC AGTTTAGACA ATGAAGTGGC CTATATGAAA 
CGAAATTGGT CAAATGCTTA TGTCTCATAT TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGT 

EF071-4 (SEQ ID NO:272) 

F KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFKKDYAAY 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 
TGVSXTGETG HYSAR 

EF072-1 (SEQ ID NO: 273) 

TAATCAATGA AAAACGCACG TTGGTTAAGT ATTTGCGTCA TGCTACTCGC TCTTTTCGGG 
TTTTCACAGC AAGCATTAGC AGAGGCATCG CAAGCAAGCG TTCAAGTTAC GTTGCACAAA 
TTATTGTTCC CTGATGGTCA ATT AC C AGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG 
CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGCATATTCA TTTATATCCA 
AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAACGTT GACACTTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
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C AAAC AATGA CTGTGTCGTA TCAAATGCGT 
ATTAACAACG AAGGACAATT AGTCACGGAC 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC 
GAACGGACAA CGAACAGCAC CGTTACATGT 



TTAGAAAAGA CCGCTGAACC TGACACTGCG 
AAACATACCT TGACTAAAAG AGCCACAGTT 
GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GGGGAATACC TCAATGAAAC AGCAAACGGG 
AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
GTACTTCTTG GAAGAAATCT CTGCACCAAA 
TTTTACGGTG GGAAAAAATT CTTATGCAAC 
AATCAATAA 



EF072-2 (SEQ ID NO: 274) 



MKNARWLSI CVMLLALFGF SQQALAEASQ ASVQVTLHKL LFPDGQLPEQ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKPIA EDKTQTINGE 
DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDPQGQ SLTHIHLYPK 
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP 
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI 
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTCN Q 

EF072-3 (SEQ ID NO:275) 

ATTACCAGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG 

CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGCATATTCA TTTATATCCA 
AAAAATGAAG AAAATCCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAACGTT GACACTTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
C AAAC AATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG 
ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATCT CTGCACCAAA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT A 



EF072-4 (SEQ ID NO:276) 



QLPEQ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP 
DGWSFSLAS KDSQQRDKAY 
NEENAYDLPP LEKTVLDKQQ 
ESIEVKVAGK TVTTGYTLTT 
NNEGQLVTDK HTLTKRATVR 
RWQKEKALAK KFTSNQAGEF 
RTTNSTVTC 



FYQLRSEGKT VQEAQRQLAE 
LFVEAEAPEV VKEKASNLW 
GFNQGEHINY QLTTQIPANI 
QKHGFTLDFS IKDLQNFANQ 
TGGKSFVKVD SENAKITLPE 
SVKGXKRWPV LLGRNLCTKR 



TGATNRKPIA EDKTQTINGE 
ILPVQDPQGQ SLTHIHLYPK 
LGYQEFRLSD KADTTLTLLP 
TMTVSYQMRL EKTAEPDTAI 
AVFIVKNQAG EYLNETANGY 
LSSESNRNSF YGGKKFLCNE 
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EF073-1 (SEQ ID NO:277) 

TAAATGAACA AATTAAATAC AAAATTACTG ATTGGCTATA TTCTTTTAGG AGCCTTAATC 
ATTGCTGTCG CTAGAGAATA TGGCTTCTTC GCTTTTGTGA TTCTGGTAGG CTTTTTAGTA 
TTCGTTCTCT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA ATAA 



EF073-2 (SEQ ID NO:278) 

MNKLNTKLLI GYILLGALII AVAREYGFFA 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS 



FVILVGFLVF VLYRKKKNAA DKSDQMPYLT 
IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
QKAATEESPQ VNEDQQ 



EF073-3 (SEQ ID NO:279) 



CT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA AT 



EF073-4 (SEQ ID NO:280) 



YRKKKNAA DKSDQMPYLT 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS 

EF074-1 (SEQ ID NO: 281) 



IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
QKAATEESPQ VNEDQQ 



TAAAGGAGTT CTCAAAAAAT GAAGCTAAAA AAAATAATTC CTGCTTTTCC CCTTCTTTCA 
ACCGTTGCAG TTGGCTTGTG GTTAACGCCT ACTCAAGCTT CTGCAGATGC TGCGGATACG 
ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 
GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
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CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCTATN ATATT 

EF074-2 (SEQ ID NO: 282) 

MKLKK IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
GRAVLLALGG ADAHIQLVKG DEQAFANEII RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LYXI 

EF074-3 (SEQ ID NO:283) 

TGC TGCGGATACG 

ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 
GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCT 

EF074-4 (SEQ ID NO: 284) 
AADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK 
GRAVLLALGG ADAHIQLVKG DEQAFANEII 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP 
WVDEVMTWVA QSNDALKYEF LY 



EF075-1 (SEQ ID NO:285) 

TAACCTATAA GAAAAAAATC ACAACCTGTG ATAAATTATT GGAGGNAAAA TATGTCAAAA 
GGGAAGAAAA TTTTTGCCAT TATCNTTGGA ATTATCTTGG NTCTATTTCT TGCAGTTGTT 
GGAATGGGAG CAAAACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 
GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 



SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
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GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 
TAA 

EF075-2 (SEQ ID NO:286) 

MSKG KKIFAIIXGI ILXLFLAWG MGAKLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 
DEQELTRVQQ ELKNQLNTK 



EF075-3 (SEQ ID NO:287) 

ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 

GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 



EF075-4 (SEQ ID NO:288) 
KLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 
DEQELTRVQQ ELKNQLNTK 



EF076-1 (SEQ ID NO: 289 ) 

TAGAAAATAA CAGAGGAGCT GAAGGAAATG 
AGCATTGCTG CAGTTGCAAG TGTCTCTGTT 
AAGGTATCTC ATGTTTCCAA TCGTTATAAA 
GGAAACCAAA AATTATTATC GATTGTCGAT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC 
GTTAAAGACA ATACAGATTC TTTAAAAGAA 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT 
TAA 



AAAGCATCAA CAAAAATTGG TATCGGTTTA 
GCAGTCATCG CTTCTGAAAA AATTATTAAG 
GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GATTTATCCG ATGATGAATT AGATTCTGTT 
GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
CGCTTTTTCA CATTTATTGA AGATGCAATG 
TTTTATAAAA ATAATTCTTT TGTTTCAACA 
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EF076-2 (SEQ ID NO:290) 



MK ASTKIGIGLS IAAVASVSVA VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 
NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNSFVST 

EF076-3 (SEQ ID NO:291) 

CATCG CTTCTGAAAA AATTATTAAG 

AAGGTATCTC ATGTTTCCAA TCGTTATAAA GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GGAAACCAAA AATTATTATC GATTGTCGAT GATTTATCCG ATGATGAATT AGATTCTGTT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
GTTAAAGACA ATACAGATTC TTTAAAAGAA CGCTTTTTCA CATTTATTGA AGATGCAATG 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT TTTTATAAAA ATAATTCTT 

EF076-4 (SEQ ID NO:292) 

VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 

NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNS 



EF077-1 (SEQ ID NO:293) 

TAATGTAAAG TGAATGATGG GAGAGAAAAA 
ACAATTATAA CAGGAGTTTT GGCATTATTA 
GCGTATGGCA TTATTTTAAT AACAGGTTCT 
ATTCAAACCT TACGTGAAGG AAAATATGGT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA 
GATAACTCGC CACAAAAAGC TCATCGCTTG 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC 
CCAATTGAAA AAAATCCTGG GGATGAATTA 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT 
TCACGTAATG GCACATATTT AGGCCGAATT 
AAAGAGACTA TGGAAAAATT ACACCAATTA 
GATCAAGAAT CCGTTGCAGA AACGATTGCT 
GAATGTTTAC CACAAGATAA ATTAACTATT 
GTCATCATGG TAGGAGATGG TGTAAATGAT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG 
GCCAAACAAT CTGTATTAAT CGGAATTTTT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT 



GAGATGAAGC ATGTAACAAA ATTGGGGATT 
TTTGAATTTA TTTTACATCA GCCGAATTGG 
GTAATGGCGT TAATGATGTT CTGGGAAATG 
GTCGATATTT TAGCGATTAC CGCTATCGTT 
AGTTTGATGA TTTTAATTAT GTTGACTGGT 
AAAGCTAACC AAGAGCTGAA GTCATTATTG 
AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GATGAATCTT CATTAACAGG AGAATCAAAA 
ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
GACAGTCAAT ATCAAACAAT TGTGAACTTA 
TTTGTACGTT TAGCAGATCG CTATGCGGTA 
GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATTGCGT TTGATAAAAC AGGCACGATT 
CAACCAATCA ATGCTGGAAT AACTGCTGCT 
CAAGAATCAA GTCATATTTT AGC TAGATC A 
CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GATGGTGCTG AGATACGGGT AGGTAAAAAG 
GAAAAAATTG ATAAAACGAC TATTCATATT 
ACTTTTACAG ACACTGTACG CCCAGAAGCA 
CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GCAGAAGTAG GAATTACCGA AGTACATGGG 
CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
GTCGAAATTG CCCAAGATAC CATGAAAATT 
ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
GGCCAGTAA 
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EF077-2 (SEQ ID NO: 294) 

MKHVTKLGIT I ITGVLALLF EFILHQPNWA YGIILITGSV MALMMFWEMI 
QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVSI 
LSALRARRIG Q 

EF077-3 (SEQ ID NO:295) 

TCA GCCGAATTGG 

GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGCTAGATCA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCC 

EF077-4 (SEQ ID NO:296) 

QPNWA YGIILITGSV MALMMFWEMI 

QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
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FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVS I 
LSALRARRIG 

EF079-1 (SEQ ID NO:297) 

TAATTTCTAG CATCACCGAA GAAATTTTTA GAAAAACAAA GAGCCTGGGC CAATCACTGT 
CCCAGGCTCT CATGCTTTAT TTTTAAGGAG GAAGCAATGA AGTCAAAAAA GAAACGTCGT 
ATCATTGATG GTTTTATGAT TCTTTTACTG ATTATTGGAA TAGGTGCATT TGCGTATCCT 
TTTGTTAGCG ATGCATTAAA TAACTATCTG GATCAACAAA TTATCGCTCA TTATCAAGCA 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
AAAACAACGA AAAAACCAGA CAAATCCTAT TTTGAAAGTC ATACGATTGG TGTTTTAACC 
ATTCCAAAAA TAAATGTCCG TTTACCAATT TTTGATAAAA CGAATGCATT GCTATTGGAA 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC TATCCTACAG GTGGTACGAA TACACATGCG 
GTCATTTCAG GCCATCGTGG TCTCCCTCAA GCCAAATTAT TTACAGATTT GCCAGAATTA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
CAAATAAAAA CCGTTGAACC AACTGATACA AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
CTCGTCACTT TATTAACTTG CACACCGTAT ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GGACATCGTA TCCCATATCA ACCAGAAAAA GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
CAACAAAATT TACTATTATG GACATTACTT TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
TTCATTATCT GGTACAAGCG ACGGAAAAAG ACGACCAGAA AACCAAAGTA G 

EF079-2 (SEQ ID NO:298) 

MKSKKKRRI IDGFMIuLLI IGIGAFAYPF 

VSDALNNYLD QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP 
TTKKPDKSYF ESHTIGVLTI PKINVRLPIF DKTNALLLEK GSSLLEGTSY 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVN GKTLAYQVDQ IKTVEPTDTK 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL 
IIWYKRRKKT TRKPK 

EF079-3 (SEQ ID NO:299) 

i 

TCCT 

TTTGTTAGCG ATGCATTAAA TAACTATCTG GATCAACAAA TTATCGCTCA TTATCAAGCA 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
AAAACAACGA AAAAACCAGA CAAATCCTAT TTTGAAAGTC ATACGATTGG TGTTTTAACC 
ATTCCAAAAA TAAATGTCCG TTTACCAATT TTTGATAAAA CGAATGCATT GCTATTGGAA 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC TATCCTACAG GTGGTACGAA TACACATGCG 
GTCATTTCAG GCCATCGTGG TCTCCCTCAA GCCAAATTAT TTACAGATTT GCCAGAATTA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
CAAATAAAAA CCGTTGAACC AACTGATACA AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
CTCGTCACTT TATTAACTTG CACACCGTAT ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GGACATCGTA TCCCATATCA ACCAGAAAAA GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
CAACAAAATT TACTATTATG GACATTACTT TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
TTCATTATCT GGTACAAGCG ACGGAAAAAG ACGACCAGAA AACCAA 



GLDPFSETQK 
PTGGTNTHAV 
DLHIESGQDL 
IACALIISGF 



EF079-4 (SEQ ID NO:300) 
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PF 

VSDALNNYLD QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP GLDPFSETQK 
TTKKPDKSYF ESHTIGVLTI PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVH GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL IACALIISGF 
I IWYKRRKKT TRKP 

EF080-1 (SEQ ID NO:301) 

TAGTTACACT CGTTTAGGGC TAGCAACGTT AGGCATTTTC GCTGGACTCT TAGCACTCTT 
TTTATTAGGA GGTTATTTCC TATGAAAAAA CGACTTTTAC CTATTTTTTT CCTAATACTT 
CTTACCTTTG GCCTTGCCCT ACCCGTTTCG GCGGCTGAAA ATTCAATTGA TGATGGCGCA 
CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG AGATACAACC TTTAGAAGAA 
AAAACAAAAG CCTCTGTCTT TATTGTAACC ACAAATAATA ATACCTATGG CGATGAACAA 
GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
CTCATTGATA TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT 
ATGACAGATG CACGAATTGA TGATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA 
AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GTTCCTGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 
ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTAGG 
GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT TAACCAACTC TTTCATCACT 
ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGTTTTTA G 

EF080-2 (SEQ ID NO: 302) 

MKKR LLPIFFLILL TFGLALPVSA AENSIDDGAQ „. tt „ t 
LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 
IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG GGRSF 

EF080-3 (SEQ ID NO: 303) 

GGCTGAAA ATTCAATTGA TGATGGCGCA „„„„„„» 

CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG AGATACAACC TTTAGAAGAA 
AAAACAAAAG CCTCTGTCTT TATTGTAACC ACAAATAATA ATACCTATGG CGATGAACAA 
GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
CTCATTGATA TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT 
ATGACAGATG CACGAATTGA TGATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA 
AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GTTCCTGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 
ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTAGG 
GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT TAACCAACTC TTTCATCACT 
ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGT 

EF080-4 (SEQ ID NO:304) 



LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 
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IDMDLRKIYI STSGNMIDYM TDARI-DDTLD 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS 

EF081-1 (SEQ ID NO:305) 

TGAATGGAAC GAAGCAATCG TAATAAAAAA 
TCTGCCTTGG TTCTAATCGC TGCTGCCGGT 
GCCAAACAAG AATTAGCCGA AGCGAAGAAA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT 
TATGATACTA AATCTGTTGT TGAAAAATAC 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA 
TATAAATTAT CGATGAGCAC GCCTTTAGGC 
ATCGCCAAAA AAGGCGATAC CTACCAAATC 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA 
GATCGTAATG GTAGTGGGCT AGCAATTAAC 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG 
TTTGTACCAA TCACAGTCGC TTCTGAACCA 
AAAGATACAG AGTCACGTTA TTATCCGCTG 



KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
AALILSLVFL GINISKYQLK FSSYQYPFRE 
GGMGGGGSTT HSTGGGTFGG GGRS 



TCTTCAAAAA AACCACTTAT TCTTGGTGTT 
GGCGGGTATT ATGCTTATAG TCAATGGCAA 
ACAGCTACTA CATTTTTAAA CGTATTGTCA 
GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
AAGGCGAAAG ACAATCAATA CACATTTACC 
GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
GTAGATAATG CCAAACGTGG AGAAATTGTC 
AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GCCAATATCA AAGCTTTTAG TGATAAATTC 
TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
GTGACAGAAT TACCAACAGG GGCTGCGACA 
GGGGAAGCAN TGCGCAATTA A 



EF081-2 (SEQ ID NO: 306) 

MERSNRNKKS SKKPLILGVS ALVLIAAAGG 
QEFDKLPSW QEASLKKNGY DTKSWEKYQ 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA 
VPITVASEPV TELPTGAATK DTESRYYPLG 



GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 
AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
EAXRN 



EF081-3 (SEQ ID NO: 307) 



T GGCGGGTATT ATGCTTATAG TCAATGGCAA 

GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGG 

EF081-4 (SEQ ID NO:308> 

G GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 

QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG 
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EF082-1 (SEQ ID NO:309) 

TAAAAAATGA AAAAGATCGT GCGCATTTCA AGCATTTTGT TCGTTGCTAC GCCTCTTATG 
CTTTTAAATA GTTCAAAAGT TGAAGCAGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 
ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTCTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT TTAA 

EF082-2 (SEQ ID NO:310) 

MKKIVRISS ILFVATPLML LNSSKVEAAQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 
VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTPL 

EF082-3 (SEQ ID NO:311) 

AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 

ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT ' 

EF082-4 (SEQ ID NO:312) 

AQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 

VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTP 

EF083-1 (SEQ ID NO:313) 

TAATTTAAAA GACAAGGAGA AATAAAAATG AAAAAGAAAA TTTTAGCAGG AGCGCTTGTC 
GCTCTGTTTT TTATGCCTAC AGCTATGTTT GCCGCAAAAG GAGACCAAGG TGTGGATTGG 
GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT CGCTATTGCC 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA AACGCAAGTG 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA CACTTGGGGA 
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AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA AACGCCTAAA 
AATTCCATCG TTGCATTAGA TTTTGAACAT GGAGCGTTGG CTAGTGTTCC AGATGGATAT 
SataS tIaStCAGA TGCCGAAAAA GCAGCAAATA CAGAGACAAT tttgtacggt 

atccgcagaa tcaaacaggc tggctatact ccaatgtatt acagctataa gccatttaca 
StcaS SaIctatca acaaatcatc aaagagtttc ctaactcttt atggattgct 
™ tcgatggtgt gtcaccatat ccattgtatg cttatttccc aagcatggat 
SISggS tttggcaatt cacatccgct tatattgcag gtggtttaga tggtaacgta 
SSSaSg gaattacgga tagtggttat acagatacca ataaaccaga aacggatacg 

CCAGCAACAG ATGCAGGCGA AGAAATTGAA AAAATACCTA ATTCTGATGT TAAAGTTGGC 

gSaccgtca AAGTGAAATT taatctagat gcttgggcaa ctggggaagc tattccgcaa 

TGGGTAAAAG GAAACAGCTA CAAAGTGCAA GAAGTAACTG GAAGCAGAGT ATTGCTTGAA 
CATGGATTAG CAAAGGTGAT ATTGAATTAT TGCCAGACGC AACAGTCGTC 
ccSataagc AACCAGAAGC GACTCATGTC GTACAATACG GAGAAACATT ATCAAGTATT 
gcSa^aat ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT GGCTAATCCA 
aatcStt ATCCTCGTCA AGTTTTGAAA GTCAATCGAT cggcaacaag taatgtctac 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 

gIStIgI^ cIttaaacgg attagcaaat CCTAACTTGA tttatccagg TCAAACATTC 

AATTATTAA 

EF083-2 (SEQ ID NO:314) 

MK KKILAGALVA LFFMPTAMFA AKGDQGVDWA IYQGEQGRFG YAHDKFAIAQ 
^SSi^SyS^qva SAIAQGKRAH TYIWYDTWGN MDIAKTTMDY flpriqtpkn 
s^fSg aSsvpSyg gyvssdaeka antetilygm rrikqagytp myysykpftl 
mqq?S efpnslwiaa ypidgvspyp lyayfpsmdg igiwqptsay xaggldgnvd 

LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA WATGEAIPQW 
VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATWP DKQPEATHW QYGETLSSIA 
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI AAKLGTTYQA 
LAALNGLANP NLIYPGQTLN Y 

EF083-3 (SEQ ID NO: 315) 



AAAAG GAGACCAAGG TGTGGATTGG 
GCGATTTATC AAGGTGAACA AGGTCGCTTT 
CAGATTGGAG GCTACAATGC TAGCGGTATT 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG 
AACATGGACA TTGCGAAAAC AACAATGGAT 
AATTCCATCG TTGCATTAGA TTTTGAACAT 
GGAGGATATG TAAGTTCAGA TGCCGAAAAA 
ATGCGCAGAA TCAAACAGGC TGGCTATACT 
CTAAATCATG TAAACTATCA ACAAATCATC 
GCGTATCCTA TCGATGGTGT GTCACCATAT 
GGTATTGGTA TTTGGCAATT CACATCCGCT 
GATTTAACAG GAATTACGGA TAGTGGTTAT 
CCAGCAACAG ATGCAGGCGA AGAAATTGAA 
GATACCGTCA AAGTGAAATT TAATGTAGAT 
TGGGTAAAAG GAAACAGCTA CAAAGTGCAA 
GGTATCTTGT CATGGATTAG CAAAGGTGAT 
CCTGATAAGC AACCAGAAGC GACTCATGTG 
GCTTATCAAT ATGGAACAGA CTATCAAACG 
AATCTTATTT ATCCTGGTCA AGTTTTGAAA 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT 
GCTTTAGCTG CATTAAACGG ATTAGCAAAT 
AAT 



GGCTATGCAC ATGATAAATT CGCTATTGCC 
TATGAACAAT ACACATATAA AACGCAAGTG 
CATACCTATA TTTGGTATGA CACTTGGGGA 
TACTTTTTGC CACGTATTCA AACGCCTAAA 
GGAGCGTTGG CTAGTGTTCC AGATGGATAT 
GCAGCAAATA CAGAGACAAT TTTGTACGGT 
CCAATGTATT ACAGCTATAA GCCATTTACA 
AAAGAGTTTC CTAACTCTTT ATGGATTGCT 
CCATTGTATG CTTATTTCCC AAGCATGGAT 
TATATTGCAG GTGGTTTAGA TGGTAACGTA 
ACAGATACCA ATAAACCAGA AACGGATACG 
AAAATACCTA ATTCTGATGT TAAAGTTGGC 
GCTTGGGCAA CTGGGGAAGC TATTCCGCAA 
GAAGTAACTG GAAGCAGAGT ATTGCTTGAA 
ATTGAATTAT TGCCAGACGC AACAGTCGTC 
GTACAATACG GAGAAACATT ATCAAGTATT 
TTGGCGGCAT TAAATGGATT GGCTAATCCA 
GTCAATGGAT CGGCAACAAG TAATGTCTAC 
ATTGCAGCAA AACTTGGCAC TACTTATCAA 
CCTAACTTGA TTTATCCAGG TCAAACATTG 
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ef083-4 (seq id no:316) 
kgdogvdwa iyqgeqgrfg yahdkfaiaq 

Sasgiy eqytyktqva saiaqgkrah tyiwydtogn mdxakttmdy plpRIQTpkn 
S ehg a£asvpdgyg gyvssdaeka antetilygm rrikqagytp myysykpftl 
Snyqqiik efpnslwiaa ypidgvspyp lyayfpsmdg igiwqftsay iaggldgnvd 
Stdsgyt dtnkpetdtp atdageeiek ipnsdvkvgd tvkvkfnvda watgeaipqw 

VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATWP DKQPEATHW QVGETL^IA 

YQYGTDYQTL aalnglanpn liypgqvlkv ngsatsnvyt vkygdnlssi aaklgttyqa 

LAALNGLANP NLIYPGQTLN 
EF084-1 (SEQ ID NO: 317) 

TAGTCAAACG TTTATTTTTT CCTTAAATCC AGAAAAAATC CCGTAATTAT GGTACACTAC 
SaSS GGAGGAGAAC TATGAAGAAA TTTGATGTAA TTATTGTCGG tgctgggacg 
AGCGGTATGA TGGCCACGAT TGCGGCCGCC GAAGCAGGCG CTCAAGTATT ATTGATTGAA 
AAAAATCGCC GTGTTGGGAA AAAATTATTA ATGACTGGTG GCGGCCGCTG TAATGTAACC 
AATAATCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTG GGAATGGAAA ATTTTTATAC 
AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT 

cacSaaaag aagaagatca cggacgcatg ttccctgtta cagataaatc GAAGTCAATT 

GTTGATGCGC TATTTAACCG CATTAACGAA TTAGGAGTCA CTGTTTTTAC AAAAACACAG 
GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA 
ATTTATGCAC CGTGTGTTGT ATTAACAACT GGCGGCCGCA CTTATCCTTC CACAGGAGCA 

actggtgatg gctataaact agccaaaaaa atggggcata ccatcagccc ^tctaccct 

ACCGAATCAC CTATTATTTC TGAAGAACCT TTTATCCTGG ATAAAACGTT GCAAGGTCTC 

tcSScSg atgttaattt aactgttttg aaccaaaaag gaaaaccttt agttaatcat 
SIISgSa Sctgtttac acattttggc atttcaggac ctgccgcgct ccgctgttct 

AOTTTTATTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT 

gtcScSa cSaa?ctS tgaagaagtg cctcccaaac aactaacaga AAAGCAACGN 
cSSSS tg^ctact gaaagacttt cagttcactg ttacgaaaac attccctttg 
gaUaatct? ttgtcacagg cggtgggatt tccctcaaag aagtgacccc taaaacaatc 

GaISSaT TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT 

acSga^ IcStgttac agctccattt gtcactggac atgttgctgg ctcccatgcc 

GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TCTAA 
EF084-2 (SEQ ID NO:318) 

MKKF DVIIVGAGTS GMMATIAAAE AGAQVLLIEK _ Tr , TOC , Mr , Tt , 
NRRVGKKLLM TGGGRCNVTN NRPAEEIISF IPGNGKFLYS AFSQFDNYDI MNFFESNGIH 
liKEEDHGRMF PVTDKSKSIV DALFNRINEL GVTVFTKTQV TKLLRKDDQI ^TELEKI 
YAPCWLTTG GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS 
LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI SGPAALRCSS FINQELTRNG NQ™^ 
FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ FTVTKTLPLE KSFVTGGGIS LKEVTPKTME 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV 

EF084-3 (SEQ ID NO:319) 

C GAAGCAGGCG CTCAAGTATT ATTGATTGAA m .^ m ^^ 
AAAAATCGCC GTGTTGGGAA AAAATTATTA ATGACTGGTG GCGGCCGCTG TAATGTAACC 
AATAATCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTG GGAATGGAAA ATTTTTATAC 
AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT 
CACTTAAAAG AAGAAGATCA CGGACGCATG TTCCCTGTTA CAGATAAATC GAAGTCAATT 
GTTGATGCGC TATTTAACCG CATTAACGAA TTAGGAGTCA CTGTTTTTAC AAAAACACAG 
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GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA 
ATTTATGCAC CGTGTGTTGT ATTAACAACT GGCGGCCGCA CTTATCCTTC CACAGGAGCA 
ACTGGTGATG GCTATAAACT AGCCAAAAAA ATGGGGCATA CCATCAGCCC GCTCTACCCT 
ACCGAATCAC CTATTATTTC TGAAGAACCT TTTATCCTGG ATAAAACGTT GCAAGGTCTC 
TCTTTACAAG ATGTTAATTT AACTGTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT 
CAAATGGATA TGCTGTTTAC ACATTTTGGC ATTTCAGGAC CTGCCGCGCT CCGCTGTTCT 
AGTTTTATTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT 
GTGTTTCCGA CAAAATCTTT TGAAGAAGTG CCTGCCAAAC AACTAACAGA AAAGCAACGN 
CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG 
GAAAAATCTT TTGTCACAGG CGGTGGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG 
GAGAGCAAAT TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT 
ACTGGAGGCT ACAATGTTAC AGCTGCATTT GTCACTGGAC ATGTTGCTGG CTCCCATGCC 
GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TC 

EF084-4 (SEQ ID NO:320) 

E AGAQVLLIEK 

NRRVGKKLLM TGGGRCNVTN NRPAEEIISF IPGNGKFLYS AFSQFDNYDI MNFFESNGIH 

LKEEDHGRMF PVTDKSKSIV DALFNRINEL GVTVFTKTQV TKLLRKDDQI IGVETELEKI 

YAPCWLTTG GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS 

LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI SGPAALRCSS FINQELTRNG NQPVTVALDV 

FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ FTVTKTLPLE KSFVTGGGIS LKEVTPKTME 

SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV 

EF085-1 (SEQ ID NO:321) 

TAACCCATGA AATCATTTTG TCCCGCATAT GGGGATATGA CTTTGACGGT GATGGCAGCA 
CAGTCCACAC TCATATCAAA AATCTGCGGG CGAACTGCCG GAAAATATCA TCAAAACCAT 
CCGCGGTGTA GGTTACCGAT TGGAGGAATC ATTATAATGG AAAGAAAAGG GATTTTCATT 
AAGGTTTTTT CCTATACGAT CATTGTCCTG TTACTGCTTG TCGGTGTAAC GGCAACACTG 
TTTGCACAGC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 
TATCAGCCAT TGGTGGAACT GATTCAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA 
GGGCTGTTTC ACTACAATAA CCAATCCTTT GAGTTTTATA TTGAAGATAA AGAGGGAAGC 
GTACTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG TTAGGCCCGA CTTTCTTTAT 
GTGGTACATA GAGATGATAA TATTTCGATT GTTGCTCAAA GCAAGGCAGG TGTGGGATTG 
CTTTATCAAG GGCTGACAAT TCGGGGAATT GTTATGATTG CGATAATGGT TGTATTCAGC 
CTTTf ATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC 
AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG AAAGGATGAG 
CTTGGCGCAC TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA 
AGGCTGGAGG ATGAAATCGC AAGGGAACAT GAGTTGGAGG AAACACAGCG ATATTTCTTT 
GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 
ATGCTTGAAA ATATCGGTGA CTACAAAGAC CATTCTAAGT ATCTGCGCGA ATGCATCAAA 
ATGATGGACA GGCAGGGCAA AACCATTTCC GAAATACTGG AGCTTGTCAG CCTGAACGAT 
GGGAGAATCG TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA 
CCCGATTTTC AAACCTTGGC AGAGGCAAAC AACCAGCGGT TCGTCACAGA TATTCCAGCC 
GGACAAATTG TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG CGCTATCCAA TGTCATATTG 
AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA TATGGAGTGA GCCTGGGGCT 
GAAAAATACC GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA 
AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 
CGGTTTGGGG CTTGCCATCG TACAAAAAAC GCTGGATGCC ATGAGCCTCC AATATGCGCT 
GGAAAACACC TCAGATGGCG TTTTGTTCTG GCTGGATTTA CCGCCCACAT CAACACTATA 
AATATTTAA 
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EF085-2 (SEQ ID NO:322) 

vXSl^L LLVGVTATLF AQQFVSYFRA MEAQQTVKSY QPLVELIQNS DRLDMQEVAG 
SKSSe FYIEDKEGSV LYATPNADTS NSVRPDFLYV VHRDDNISIV aqskagvgll 

miaimvvfsl lcayifarqm ttpikalads ANKMANLKEV PffiW . 
Sdmhsm yirlketiar LEDEIAREHE leetqryffa aashelktpi aavsvllegm 
?Sgdykdh skylrecikm mdrqgktise ilelvslndg rivpiaepld IGRTVAELLP 
dtoSSSn qrfvtdipag qivlsdpkli qkalsnviln avqntpqgge vriwsepgae 
kSSS^ vhiddtalsk lfipfyridq arsskkwakr fgachrtkna gcheppicag 
khlrwrfvla gftahintin I 

EF085-3 (SEQ ID NO: 323) 

AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 

StSSSS^SLt gattcagaat agcgataggc ttgatatgca agaggtggca 
gStgtttc actacaataa ccaatccttt gagttttata ttgaagataa agagggaagc 
ScSSg ccIcaccgaa TGCCGATACA tcaaatagtg ttaggcccga ctttctttat 
gt^tacata gagatgataa tatttcgatt gttgctcaaa gcaaggcagg tgtgggattg 
t™l?cZ Sacaat tcggggaatt gttatgattg cgataatggt ^attcagc 
cStSc"g cgtatatctt tgcgcggcaa atgacaacgc cgatcaaagc cttagcggac 

AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG AAAGGATGAG 
TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA 

aSSSSg atcaaatcgc aagggaacat GAGTTGGAGG aaacacagcg atatttcttt 
SgSgcc? ctcatgagtt aaaaacgccc atcgcggctg taagcgttct gttggaggga 
aSSSS aSggtga ctacaaagac cattctaagt atctgcgcga atgcatcaaa 
^SSaca ggcagggcaa aaccatttcc gaaatactcg agcttgtcag cctgaacgat 
SSScg ££££££ cIaaccgctc gacatagggc gcacggttgc cgagctgcta 
ScgaS aaaccttggc agaggcaaac aaccagcggt tcgtcacaga tattccagcc 
SaIaSg tcctgtccga tccgaagctg atccaaaagg cgctatccaa tgtcatattg 

AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA TATGGAGTCA GCCTGGGGCT 
SSSSS GTCTTTCCGT T^AACAT* GGCGTTCACA TTOATGATAC ^TTCA 
AArCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 
cSSS SScCATCG TACAAAAAAC GCTGGATGCC ATGAGCCTCC AATATGCGCT 
SaSSS tcIgatggcg TTT^TTC^ GCTGGATTTA CCGCCCACAT CAACACTATA 
AATATTT 

EF085-4 (SEQ ID NO:324) 

S^f^^^ " AQSKAGVGIjL 

miaimvvfsl lcayifarqm ttpikalads ankmanlkev pp™™ 
SdLsm yirlketiar ledeiarehe leetqryffa aashelktpi ^vsvllegm 
lenigdykdh skylrecikm mdrqgktise ilelvslndg Rivpxaepld ^™eu£ 
dfotlaeann qrfvtdipag qivlsdpkli qkalsnviln avqntpqgge vriwsepgae 

KYRLSvS VhSdTALSK LFIPFYRIDQ ARSSKKWAKR FGACHRTKNA GCHEPPICAG 
KHLRWRFVLA GFTAHINTIN I 

EF086-1 (SEQ ID NO:325) 

^ rrpf ™ G GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
SSgZ ££££ AGATGCAGAA »»»»» 

r-TGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG AGCTAACriA 
aSgSt^t CAATCACCGT TTTAGGTACT GGACTTTTGT TAGAAGATAA TCAACGCCTA 
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GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTC TAGTT AT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF086-2 (SEQ ID NO:326) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
TNTFAKISKN YGKTVENGTY EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
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EKDYTASSWK VYSEALKQAQ TVADQTTATO == = EQK 

NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIAbvl 
LLVFRKSKSK K 

EF086-3 (SEQ ID NO:327) 

■■■■■■ 



AAT 

EF086-4 (SEQ ID NO:328) 



EF087-1 (SEQ ID NO:329) 



====== 

tcgcmSS Sgacggcat gctttattta tacaatcaag actttgctca gtttgatgaa 

otnrrTi&AA CWATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 

GTTGCTAAAA AATCTTGGTT ATCGGATG AT TCATCCACAG 

ScGAA^aS TGGGAAAACG AATGAAGAAA TCGCAGCTCT ™AAAAAC 

SaSS ctcSSaga AAATACAGCA AACTTACAAG CCATTGAAGC aggtaattat 

======== 
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GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC C3GGGTTAAAT 
SSSSSc GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
SSSSS AGGAACAAAA SaACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 

« G Sgcattgaa acaagcacaa ac TCTC gcag atcaaacaac agcaacgcaa 
gSgaIgSg accaagcaga aacagagtta cgttcggcag tgaagcaatt ^taaaagtg 

CCaIcSg AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 

caa^aSaag actacaccgc aagcagttgg aaagtctaca gtgaagcatt gaagcaagcg 
SSc^g cagatcaaac aacagcaacg caagcagaag tagaccaagc agaagcaaaa 
™ cagtgaagcg attaacattg aaaaatagtg gggaaaataa aaaggagcaa 
SiSSS ggaSaatcg acacttaaat actagtacag gagttgatca aactcgtacg 
AAAAA ™° °™^Sg ccaaggtggt ttcagaaaag ctagccaatt tttaccgagc 
£aSag£a ag™? cgcTtStg a^gtc ttctagttat cgccagtggg 

TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 
EF087-2 (SEQ ID NO-.330) 

r VGIiANWFRA AIiTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 

SSSSSS Slednorlv qvqeavpsvl ksvssgdgly PDGSLIQHGY FPYNGSYGNE 
JSSS^t SqgsdS Spnisnlfnv vdkgylqlmv NGKMPSMVSG RSISRAPETN 
p™Sgk eStlS kfapenlrnd iytsiqtolq qsgsyyhffk kprdfealid 

™S AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
u^^fv NODFAOFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
S«L D Q KS« AKKSWFLLDG QII^SGIT GTTDASIETI = HPQE 

SS5S S5SS S5S2 SEE 55SE 35SS 

EES SSSSS SVISEKIDNG == OK = 

S5S2S SSSSS SSSS ESSE S = 

EkSSsS £SLcQAQ TVADQTTATQ AEVDQAEAKL, KBAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI IGLLVIASGC 
LLVFRKSKSK K 

EF087-3 (SEQ ID NO:331) 

i^^llSS^ 0 ^^^ GATAATTCTT GGATTAGTTT AAGCGCAGCG 
aSS££ SaSS CTATGTTTTT CCTAATTCNA TGAATACGCT tgatgttcaa 
GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
ScISSa caSStaa AATTAGTAAA AATTATGGCA AGACTGTTGA aaatggtact 
ScGAA^S TAACAGTCGT TGGGAAAACG AATCAAGAAA TCGCAGCTCT rrCTAAAAAC 

ISSaS cS tTga aaatacagca aacttacaag ccattgaagc aggtaattat 
g^SISa aScatggaa taatgaccaa gaaattccag gactgtatgc gtatgatcca 
a™™ Scagaaaa aattgataac ggtotttatc gcttaactct tgcgaatcct 
ttacaaaata atgcatcc 

ef087-4 (seq id no: 332) 

^MnSnKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
S EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRIiTLANPL QNNAS 

EF088-1 (SEQ ID NO:333) 
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TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC TCCGCTGATG 
CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG AGCTAACTTA 
ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT TAGAAGATAA TCAACGCCTA 
GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTCT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF088-2 (SEQ ID NO:334) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
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TNTFAKISKN YGKTVENGTY EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI IGLLVIASGC 
LLVFRKSKSK K 

EF088-3 (SEQ ID NO:335) 

A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 

AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAA 

EF088-4 (SEQ ID NO:336) 

T PEVTKEALEK LIQEQKEHQE KDYTASSWKV 

YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKK 

EF089-1 (SEQ ID NO:337) 

TGACAGATAC ACCTGCTAAC ACAGGAAACT AAGAACGACA GCATACACGC AAGATCGGGA 
TATAGGTCAA AAATTTTTTG GCTTATCTTT CGGTCTTTTG GTGCTTATAA TACAACAAAG 
AATGACAGAC ATAGGAGAAT GAATATGAAC AGATGGAAAG TATATGCAAC GGTAATCGCT 
TGTATGTTAT TTGGCTGGAT TGGCGTGGAG GCGCACGCTT CTGAATTTAA TTTTGCGGTC 
ACACCAACAA TTCCCGAAAA TCAAGTGGAT AAATCAAAAA CCTACTTTGA CTTAAAAATG 
GCGCCTGGTG CCAAACAAAC CGTAGAAATT CAGTTACGCA ATGATACAGA TGAAGACATT 
ACCATTGAAA ATACGGTGAA CTCAGCGACA ACAAATTTAA ATGGCGTAGT AGAATATGGC 
CAAAACGGGA TCAAACCTGA CAAAACCTTA CGTTTTAACT TAAAAGATTA TGTGGAAGCA 
CCGAAAGAAA TCATCTTGCC GAAGCATTCC CAAAAGACCT TACCTTTAAC CATTACGATG 
CCTAAAGATT CTTTTGATGG CGTGATGGCT GGCGGTATAA CACTCAAAGA GAAAAAGAAA 
GAAACAACGA CTTCTGCGGA TCAATCAAAA GGGTTAGCTA TTAATAATGA ATACTCCTAT 
GTTGTGGCTA TTATTCTTCA GCAAAATGAG ACAAAGGTTC AACCAGATTT AAAATTACTG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA AACGTCATCA ATGTTTCTTT ACAAAACCCA 
CAAGCGGCCT ATTTAAACCA ATTACATTTA ATCAACACTG TTTCAAAAGG AGGCGAAACG 
CTTTACCAAT CCGATACTGA GGATATGCAA GTGGCGCCAA ACTCTAACTT TAGTTACCCA 
ATTTCTTTAA AAGGGGAACG ATTAACGCCA GGAAAATATG TCTTGAAATC AACGGCCTAT 
GGTGTAAAAG ATGAAAAGGG CACCTATCAA GTCAAAGGCG CCAATGGTGA AGAACGGTAC 
CTGTACAAAT GGGAATTTAC AAAAGAATTT ACTATTTCTG GGGACGTCGC TAAAGAATTA 
AATGAAAAAG ACGTAACCAT TAAAGGAACC AATTGGTGGT TGTATCTACT GATTGCATTA 
ATCATTCTAG CGCTGCTCTT ATTGATTTTC TTCTTGTATC GTAAAAAGAA AAAAGAGGAA 
GAACAACAAT CTGAGCAATA A 



EF089-2 (SEQ ID NO:338) 
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MNR WKVYATVIAC 

MLFGWIGVEA HASEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 
IENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 
KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAI ILQQNET KVQPDLKLLG 
VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 
SLKGERLTPG KYVLKSTAYG VKDEKGTYQV KGANGEERYL YKWEFTKEFT ISGDVAKELN 
EKDVTIKGTN WWLYLLIALI ILALLLLIFF LYRKKKKEEE QQSEQ 

EF089-3 (SEQ ID NO:339) 

T CTGAATTTAA TTTTGCGGTC 

ACACCAACAA TTCCCGAAAA TCAAGTGGAT AAATCAAAAA CCTACTTTGA CTTAAAAATG 
GCGCCTGGTG CCAAACAAAC CGTAGAAATT CAGTTACGCA ATGATACAGA TGAAGACATT 
ACCATTGAAA ATACGGTGAA CTCAGCGACA ACAAATTTAA ATGGCGTAGT AGAATATGGC 
CAAAACGGGA TCAAACCTGA CAAAACCTTA CGTTTTAACT TAAAAGATTA TGTGGAAGCA 
CCGAAAGAAA TCATCTTGCC GAAGCATTCC CAAAAGACCT TACCTTTAAC CATTACGATG 
CCTAAAGATT CTTTTGATGG CGTGATGGCT GGCGGTATAA CACTCAAAGA GAAAAAGAAA 
GAAACAACGA CTTCTGCGGA TCAATCAAAA GGGTTAGCTA TTAATAATGA ATACTCCTAT 
GTTGTGGCTA TTATTCTTCA GCAAAATGAG ACAAAGGTTC AACCAGATTT AAAATTACTG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA AACGTCATCA ATGTTTCTTT ACAAAACCCA 
CAAGCGGCCT ATTTAAACCA ATTACATTTA ATCAACACTG TTTCAAAAGG AGGCGAAACG 
CTTTACCAAT CCGATACTGA GGATATGCAA GTGGCGCCAA ACTCTAACTT TAGTTACCCA 
ATTTCTTTAA AAGGGGAACG AT 

EF089-4 (SEQ ID NO:340) 

SEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 

IENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 

KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAI ILQQNET KVQPDLKLLG 

VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 

SLKGER 

EF090-1 (SEQ ID NO:341) 

TAGTCTCTAA GAAATAAACC TAAAATTATT GATATAAAGG ATGAACAAAT GAAAAAAGAA 
GAAATGCAAA TGCGTAATAC ACGTCGTCAA AAATCAGGAA AAAATAATAA AAAGAAAGTA 
ATTATTACTT CTTTGGTTGG ACTAGCTCTG GTTGCTGGGG GCAGTTATGT TTATTTTCAA 
AGTCACTTTT TNCCAACCAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 
GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TCACCACAGA TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 
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CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGC TGTATCA ATACGCCAGG AACAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT ATAA 



EF090-2 (SEQ ID NO:342) 

MRNTRRQK SGKNNKKKVI ITSLVGLALV AGGSYVYFQS 

HFXPTTKVNG VSVGWLNVNA AEEKLAQVNQ TEEVWQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
DAPGEFDKPV DYGEEV 

EF090-3 (SEQ ID NO:343) 

CAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 

GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TC AC CAC AG A TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 
CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA ATACGCCAGG AACAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT AT 

EF090-4 (SEQ ID NO:344) 

TKVNG VSVGWLNVNA AEEKLAQVNQ TEEVWQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
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DAPGEFDKPV DYGEEV 



EF091-1 (SEQ ID NO:345) 

TAATTGGNGG AGATTTTTAT GGCTAAAAAA 
GGAACAGCAG CAGCCGTTGC CGCATTATTA 
GATGATTTAT CAAATCAAAC AGATGATTTA 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT 
GATCAATTTA AAAAAGCAGC TGACGATGCT 
GCCGAAGATG CAGCAGAAGA TATCTATATT 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT 
GGGTAA 



GGCGGATTTT TCTTAGGNGC AGTAATTGGT 
CTTGCACCAA AATCAGGTAA AGAATTACGT 
AAAAACAAAG CGCAAGATTA CACAGATTAT 
ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
AACTTTAAAA AACAAAGAGG TGATTTATCT 
CAAGATCACG CAGAAGATTT AGGTGAAATT 
GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 



EF091-2 (SEQ ID NO:346) 



MAKKG GFFLGAVIGG TAAAVAALLL APKSGKELRD DLSNQTDDLK NKAQDYTDYA 
VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 
AEAKEDVKDA AKDVKKEFKG 



EF091-3 (SEQ ID NO:347) 



AT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA AACAAACAGG TGATTTATCT 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 
GGGTAA 



EF091-4 (SEQ ID NO:348) 

SNQTDDLK NKAQDYTDYA 
VQKGTELTEI AKQKAGVLSD QASDLAGSVK 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID 
AEAKEDVKDA AKDVKKEFKG • 



EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 



EF092-1 (SEQ ID NO:349) 



TAAGGGGATG AAGAAAAAAT GGCAAAAAAA 
AGCACGAGTT TATTAGTAAC AAAAATGCAA 
GACATCTTTG CAGTATCGGC TTCTGAAGCA 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG 



ACAATTATGT TAGTTTGTTC CGCAGGAATG 
AAAGCAGCAG AAGATCGTGG CATGGAAGCA 
GATACAAACT TGGAAAATAA AGAGGTGAAT 
ATGAAAGGGC AATTTGAACA AAAATTACAA 
ATGGCAGATT ATGGCATGAT GAATGGCGAA 
GGATAA 



EF092-2 (SEQ ID NO:350) 



MAKKT IMLVCSAGMS TSLLVTKMQK AAEDRGMEAD IFAVSASEAD TNLENKEVNV 
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LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF092-3 (SEQ ID NO:351) 
AG AAGATCGTGG CATGGAAGCA 

GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA AAAATTACAA 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGAT 

EF092-4 (SEQ ID NO:352) 



EDRGMEAD IFAVSASEAD TNLENKEVNV 

LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 



EF093-1 (SEQ ID NO:353) 



TAGTTTTTTT CCGATAAAGG GAGAATTTTA ATGAGGCAAA AATATTCAGG AAACTTATTG 
TTCACGGCCA TGGCCATTGT TTATTTGATG AGTTTTCTCG CCCTTCAGTT ACTAGAAGAA 
CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGGA ATAG 

EF093-2 (SEQ ID NO:354) 

M RQKYSGNLLF TAMAIVYLMS FLALQLLEER QLTQKFTQAT QEYYAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETILE 



EF093-3 (SEQ ID NO:355) 

CCTTCAGTT* ACTAGAAGAA 
CGTCAGTTAA CACAAAAATT TACGCAAGCT 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA 
CTCGTATACG CGCAAGTGAC CCTCGATTAT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC 
AAAGCGGAAA CAATACTGG 



ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
ACATACAAAA ATGAACAATT AAGAATAACT 
CAATATCAAG AGAGAGTTTC TCATCAAAAA 



EF093-4 (SEQ ID NO:356) 
LQLLEER QLTQKFTQAT QEYYAGKSIF 

HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETI 



EF094-1 (SEQ ID NO:357) 



TAAACATTTG AGACATTCAG 
ATTAATTATA TGAAAAAAAC 
TTAAGTCAAA CAATTGGCGG 
CACCCTCAAG AGGTAACAAT 
TTTAGCGATG GCAGCACGCT 
GCAAAACAAA CAGTATTTTG 



AGGTGAATGT CTCTTTTTTA 
AACATTTAAA AATTGGTCGT 
AACGATTGGT CCTACGATTG 
TCATTATGAC GTAAGTAAAC 
CTCAGAACGT ACTACGTCAT 
TATTGAACCA GGTGTTAGTA 



TTACTCAAAA ACGAAAGGGG 
TATTTGCGAC TTTGGCTCTA 
CTTTTGCCGA TGAAATTACT 
TGTATGAAGT TGACGGAACT 
TATATGCAGA ATACAATGGT 
TTCCAACAGA AGTGACGCAC 
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GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACGACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AAC TGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF094-2 (SEQ ID NO: 358) 



MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG 



VSIPTEVTHG 
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YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF094-3 (SEQ ID NO: 359) 

CGA TGAAATTACT 

CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAG 

EF094-4 (SEQ ID NO:360) 

DEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK 
RKDSPAGEIV QEITTDEKGR AETPKELANA 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK 
SEAFKTELVK GTKASDET 

EF095-1 (SEQ ID NO:361) 



KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 



TAAGAATTGT TGGATTGTTC TTTAGAAAGA AGGGACAATA TGAAGCGAAG TAAATGGAAA 
GAATTGATAG TAACGGGCAT CTGCCATATA TTAGTATTCC CCATACTAAT ACAGACAACT 
GTTTTTGCAG AAACATTACC AAGTACAAAA CAAGTAAGAG AAGGAACCAA TCATTCATTA 
ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATAC TTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAAAATGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATATCAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAACCA 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG TGACAGGCCC ATAA 



EF095-2 (SEQ ID NO:362) 

MKRSKWKE LIVTGICHIL VFPILIQTTV FAETLPSTKQ VREGTNHSLT 

AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 

QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
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LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDE3DVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF095-3 (SEQ ID NO:363) 

AAGTACAAAA CAAGTAAGAG AAGGAACCAA TCATTCATTA 

ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAAAATGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATATCAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT AGAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAACCA 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG TGACAGGCCC A 

EF095-4 (SEQ ID NO:364) 



STKQ VREGTNHSLT 
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TABLE 1. Nucleotide and Amino Acid Se 

AEKAESEQPQ TKDKLHDEET LALSKSELID 
QPVNANEILL QYHSWQGNSP DGINVWEGES 
LAASNQTFFL PRYYTSLSLY NKKGEIDPNY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK 

EF096-1 (SEQ ID NO:365) 

TGAGGTGGCC AAGTTAAAAT GAAAAAATTA 
ATTGTCATCT TGTTTTTTGG CGTGCGCCAA 
GATACCTTGA CCATTTACAA TTGGGGGGAC 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA 
GAAAACATTG ATGCACGCTT TTTAGATCAA 
CCGTACTTCT GGGGCACGTT GGGGATTATT 
ATCCAACATT GGGATGATTT ATGGCGCCCG 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG 
AACGACCAAC AATTACGTCA GGCTACCGAT 
GCAATTGTTG CCGATGAAAT CAAAATGTAC 
ACTTTCTCTG GTGAAGCTGC TGAAATGCTA 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT 
GCAGAATATA TTGGTTATTC CACACCAAAT 
GTTGCCGAAG ATAAACAATT TTATCCAGAT 
CAAGACTTAG GTCAAGAATA CTTAGGAATT 
TATCGGAAAT AA 
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NEANVTSQTI RERIETPNLT YRYGFINEEG 
QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
PLPTISDASG NQYPTTISQF ELEKMSAQQY 
QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
LLPS S YE IDS KTYQFQGWYK GKTKPENLEK 
EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
AISSNTQQHG LKQAADYYEN GQENPYLRLK 
GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
LEIPANQGRK DQTYQAMVTW NLVTGP 



CAGTCACTTT TTATTGGAAT TATCGCTATT 
TTGGAGAAAG CAAGTGGCAT GGCAGGAGCA 
TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAACCTTTG ATTCTAATGA AGCTATGTAT 
GATATTGCCA TTCCTTCTGA ATATATGATT 
CCACTTGATC ACAGCAAATT AAAAGGCTTA 
TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
TATAATGATA AATTTATTGA CGGCCGTCAG 
GAATTAAAAA ATAATGTCAT GCTGATTGAT 
AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AAGTTAAACC GATTAACGAA CAATGTCAAA 
ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
GAAAACAATG AACATCTACA TTATGTGATT 
AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
ATGTTACGAC CAGAAAATGC GGCACAAAAT 
AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GATGAAACTA TCAAACATTT AGAAGTTTAC 
TATAACGATC TGTTCTTGGA GTTTAAGATG 



EF096-2 (SEQ ID NO:366) 



MKKLQ SLFIGIIAII VILFFGVRQL EKASGMAGAD TLTIYNWGDY IDPALIKKFE 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYD IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY 
RK 

EF096-3 (SEQ ID NO:367) 



AAGTGGCAT GGCAGGAGCA 
GATACCTTGA CCATTTACAA TTGGGGGGAC 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA 
GAAAACATTG ATGCACGCTT TTTAGATCAA 
CCGTACTTCT GGGGCACGTT GGGGATTATT 



TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAACCTTTG ATTCTAATGA AGCTATGTAT 
GATATTGCCA TTCCTTCTGA ATATATGATT 
CCACTTGATC ACAGCAAATT AAAAGGCTTA 
TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
TATAATGATA AATTTATTGA CGGCCGTCAG 
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ATCCAACATT GGGATGATTT ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AACGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA 
GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
ACTTTCTCTG GTGAAGCTGC TGAAATGCTA GAAAACAATG AACATCTACA TTATGTGATT 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT 
GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC 
CAAGACTTAG GTCAAGAATA CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG 
TATCGGAAA 



EF096-4 (SEQ ID NO:368) 

SGMAGAD TLTIYNWGDY IDPALIKKFE 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYD 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD 
RK 

EF097-1 (SEQ ID NO:369) 



IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY 



TAGAAGTATT CTAATTATCT ACATAGAGAG CGAGGGACAA GGAATATGAA GGAAAAAGAA 
ATGCATTCGC TCTTTTTTAA ACATAAGTTT GTGAAAGTAA CTCCCTATTT ACGTCGTTTT 
GGTCATCGTT TGAGTGGGAT GATTATGCCA AATTTGAGTA TTTTTATTGC GTGGAGCTTA 
TTGTCTTTGG TGGCTGGCTA TACGACTGGG AATCTACGGC TAGCTCTTTC TGAAGTCGAA 
ACGATAATGA TTCGAGTTGT TTTACCGATT CTAATTGGTT TTACAGGCGG AAAAATGTTC 
GAGGAACAAC GTGGCGGCGT TGTTGCTGCT ATTGCGACAG TGGGCGTGAT TGTTTCCACA 
GATGTTCCAC AGTTGTTTGG TGCTATGTTT ATTGGCCCTT TAGCAGGATA TACTTTCGCC 
AAAATTGAAC AAATTCTCTT ACCGAAAGTT AAAGAAGGCT ACGAGATGCT GACTAAAAAC 
TTTTTAGCAG GAATTGTGGG AGGACTGCTG TGCTGTTTTG GTATTCTGGT TGTAGCTCCG 
GCTGTTGAAA GCGCTAGTTT TTGGCTGTAT CAATTTTCTT CTTGGTTAAT TGAAGCCAAT 
CTTTTACCAT TGGTTCACGT TTTC TT AG AG CCCTTAAAAG TGTTATTTTT TAATAATGCG 
ATTAACCATG GCTTATTAAC GCCTCTAGGT TTAGAAGGTG CTAGTCAAAC AGGTCAGTCC 
ATTTTATTTC TATTGGAAAC AAACCCTGGA CCAGGCGTGG GCGTTTTGGT TGCTTTTCTG 
CTGTTTGGGC CTGTAGGACA ACGAAAAACA GCAGGAGGTG CCACCATGAT TCAACTGATT 
GGGGGCATTC ATGAAATTTA TTTTCCGTTT GTTTTGATGG ACCCGCGCTT ATTTTTAGCA 
GTAATTGCTG GAGGAATGAG TGGTACGCTT GTTTTTCAAA TATTTAATGT GGGTCTAAGT 
GCTCCAGCTT CGCCAGGTTC ATTGGTTGCG ATTTTAGCCA ATGCCCCGAC TGATGCGAGG 
CTGGCGGTTT TTAGCGGAAT TTTTGTTAGC TTTCTGTGCT CTTTTGCAAT AGCAAGCTTG 
TTATTAAAAC GTCAACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 
CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 
GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGCCA 
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA 
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTTAA 
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EF097-2 (SEQ ID NO:370) 
MLTKNF LAGIVGGLLC CFGILWAPA 

VESASFWLYQ FSSWLIEANL LPLVHVFLEP LKVLFFNNAI NHGLLTPLGL EGASQTGQSI 
LFLLETNPGP GVGVLVAFLL FGPVGQRKTA GGATMIQLIG GIHEIYFPFV LMDPRLFLAV 
IAGGMSGTLV FQIFNVGLSA PASPGSLVAI LANAPTDARL AVFSGIFVSF LCSFAIASLL 
LKRQRGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YWTSFLNKK EYQEWLEGGA DRCF 

EF097-3 (SEQ ID NO:371) 

ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 

CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 
GTTCAAAATT TTTTAGAAAT TAAATCC TAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGCCA 
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA 
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTT 

EF097-4 (SEQ ID NO:372) 

RGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YWTSFLNKK EYQEWLEGGA DRCF 

EF098-1 (SEQ ID NO:373) 

TAAATGAAAA AGACAAAAGT AATGACATTG ATGGCAACCA CAACTTTAGG CGCACTGGCA 
CTTGTACCAA TGAGTGCATT AGCAGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 
CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 
AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 
CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 
AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 
GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 
GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 
ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 
GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 
GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 
ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT AA 

EF098-2 (SEQ ID NO: 374) 



MKKTKVMTLM ATTTLGALAL VPMSALAVDG 
PITPVDPTDP TGPKPGTAGP LSIDYASSLS 



GEYQTNGAIQ FAPNTNPTNP 
FGEQTITSKN MTYYAETQKY 



VDPTNPDPDK 
KDNAGADQEG 
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PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DS VDTAKTS I SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 

EF098-3 (SEQ ID NO:375) 

AGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 

CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 
AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 
CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 
AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 
GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 
GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 
ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 
GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 
GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 
ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT 

EF098-4 (SEQ ID NO:376) 

VDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 

PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG 
PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DS VDTAKTS I SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 

EF099-1 (SEQ ID NO:377) 

TGATGTTGTA GAGGGCTGAT GAAATGTTTA TCAGTCTTCT TTTTATTGAA AGGAGAGATC 
ATGAAGAAAT TAGGCAAGGT TTTAATTGTT AGTTGTTTTA TTTTTATTCT TCCTTTTTTA 
TTATTTTTAG GTGTATTTTC TTCTAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 
ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAATAG 

EF099-2 (SEQ ID NO:378) 

MKCLS VFFLLKGEIM KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT 
PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
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VDILKLWERA GTKDDPAEQV 
INGGQCYGLS AFFVEKQGGL 
INFGQGGVAT SIYGHTGWA 



KRKASANNWY KRLSTGSMGG 
QMMGTGHMFA SEIGNDYPWS 
SVEGKNKFTT YEQNAEQGQI 



GSANVGGGKI DALEKVMGQT 
SIGWTVIKNP NYSDIKAGDV 
VAKYFRTWGL DFPHVTSIVR 



EF099-3 (SEQ ID NO:379) 

TAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 

ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAAT 

EF099-4 (SEQ ID NO:380) 



SES GDSSQFQPAT 

PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSD5DLLKRM SKSKDVNTLA 
VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 
K 

EF100-1 (SEQ ID NO:381) 

TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT ACTTTTTGTC 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAATGA 

EF100-2 (SEQ ID NO:382) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 



EF100-3 (SEQ ID NO:383) 



TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
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CAAT 

EF100-4 (SEQ ID NO:384) 
KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 
EF100-1 (SEQ ID NO:385) 

TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT ACTTTTTGTC 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAATGA 

EF100-2 (SEQ ID NO:386) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EF100-3 (SEQ ID NO:387) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAAT 

EF100-4 {SEQ ID NO:388) 
KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 
EF101-1 (SEQ ID NO:389) 

TGAGGAGATG AAACGAAGAA AATGAAGAAG AAAACGATAA TTATATTGGG GGCAGTTGCG 
GTAATTGCGG TTGGGGGCAT CGTAACTGTG AATGCGTTAA ATAAAAATGC ACAACAAGTA 
GCTGTCAAGC AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT TCCCGACTTG 
CAACAAATTT ATATTAACGG TGTCATCCAA CCGGAACAAA TGGAAGCCTT TGCGCGTGAT 
CAAAAAATAA CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT AGATGCAGGC 
ACAGAATTAT TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGGAA TCGGGCCATT 
GATAAATTTA ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA TTTAAATGAA 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA TGAAACCTTA 
GCGGATTTAG GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTCCAGAAG TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA AGATCTTTAT 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT CGATGATAAT 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT GGCAATCCAG AGGGCGGCAC AACGATGTCT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG CTACCATATG 
CAAGCAACCA TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG AATGATTTTG GAACCATCAT TCGTCGTGAT 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GACCGAGTGG TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT TGAATCAGAT 
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GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAATAG 



EF101-2 (SEQ ID NO:390) 



MKKK TIIILGAVAV IAVGGIVTVN ALNKNAQQVA VKQAPKDDWG IDYFDVPDLQ 
QIYINGVIQP EQMEAFARDQ KITKDPEIKV KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
LAKLETKRAN IYNKWNRAID KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
DLGAKQYIST KANFKGRVSI PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
TSVSNNVWD GSISYIDDNP PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
ATIDLGDLGA IELPKKAIQK EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD 
RWISSKKPV KVGDIVESDA AIASDESATN ESMTDASK 



EF101-3 (SEQ ID NO:391) 

TAAAAATGC ACAACAAGTA 
GCTGTCAAGC AAGCGCCTAA AGATGACTGG 
CAACAAATTT ATATTAACGG TGTCATCCAA 
CAAAAAATAA CAAAGGATCC AGAGATTAAG 
ACAGAATTAT TTACTTATGA AGATGAGGCG 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG 
GATAAATTTA ATAAAACTAA AGAAGAAGAC 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT 
GCGGATTTAG GAGCGAAGCA ATATATTTCC 
ATTCCAGAAG TAAAAGATGC CAATTCACCG 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT 
CAAGCAACCA TTGATTTAGG CGATTTAGGG 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA 
GACCGAGTGG TTATTTCTTC AAAAAAACCA 
GCAGCGATTG CTTCTGATGA ATCAGCAACC 



GGAATTGACT ATTTTGACGT TCCCGACTTG 
CCGGAACAAA TGGAAGCCTT TGCGCGTGAT 
GTGAAAAACG GCGATGTCGT AGATGCAGGC 
GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AATATCTATA ATAAGTGGAA TCGGGCCATT 
CGCACGATGT CTGGTGATGA TTTAAATGAA 
GAAGAGATTA CCTTCACCAA TGAAACCTTA 
ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTTTACGGT TAACTTCAGA AGATCTTTAT 
ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
GATGGCTCAA TTTCTTACAT CGATGATAAT 
GGCAATCCAG AGGGCGGCAC AACGATGTCT 
TTAGACAAAG TCAAAAATGG CTACCATATG 
GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AATGATTTTG GAACCATCAT TCGTCGTGAT 
ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GTAAAAGTCG GTGATATTGT TGAATCAGAT 
AACGAATCAA TGACAGATGC GTCGAAAT 



EF101-4 (SEQ ID NO:392) 

KNAQQVA VKQAPKDDWG IDYFDVPDLQ 
QIYINGVIQP EQMEAFARDQ KITKDPEIKV 
LAKLETKRAN IYNKWNRAID KFNKTKEEDR 
DLGAKQYIST KANFKGRVSI PEVKDANSPI 
TSVSNNVWD GSISYIDDNP PEGNSDAASG 
ATIDLGDLGA IELPKKAIQK EGEQAYVLVN 
RWISSKKPV KVGDIVESDA AIASDESATN 

EF102-1 (SEQ ID NO:393) 
TAAACATTTG AGACATTCAG AGGTGAATGT 
ATTAATTATA TGAAAAAAAC AACATTTAAA 
TTAAGTCAAA CAATTGGCGG AACGATTGGT 
CACCCTCAAG AGGTAACAAT TCATTATGAC 
TTTAGCGATG GCAGCACGCT CTCAGAACGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA 
GGTTATCAGA AAAACCCTTT GCCATCAATG 
TGGGAAAAGG CTGGAACAGA TATTGATACA 
GAAGTGAACG GTTATAAACT CCATTCCATA 



KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
DFGTIIRRDV QVGQENGDKM AIESGLESAD 
ESMTDASK 



CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 
AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
CCTACGATTG CTTTTGCCGA TGAAATTACT 
GTAAGTAAAC TGTATGAAGT TGACGGAACT 
ACTACGTCAT TATATGCAGA ATACAATGGT 
GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
AATATGGTTG CACAAAAGAT GATTTGGGAA 
AAAAGATTAG GTGGTGCTTC AGTTGATATA 
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AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGC TTCTG A TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGIT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF102-2 (SEQ ID NO:394) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHS IK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTE INYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF102-3 (SEQ ID NO:395) 

TT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA 

EF102-4 (SEQ ID NO:396) 

LDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 

IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTE INYE KDGNVNGKHN 
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EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VK 
EF103-1 (SEQ ID NO:397) 



TAAGATAGGT TTATCAAAGA AAAGGAGCGA 
ATTACTTTAG TAACATTAAG TACGTTACTT 
GATCATGCAG CCAATCCAAA TAGTGCTACA 
CAAACAAGAG GCGACAAGGC GACTAAGATT 
GTTTATGATG CTGCTGGTAA TGATTTAACG 
AAATATGATG GTGAAACCGG TTTTTACGAG 
GGTGACGAAG GAACATTTTT TGTGACAGGT 
ACACAAAATT ATCAAGCCGT AGTGGATTTA 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT 
TATCATGGGA AAAAATTAGC TTTTACAAAT 
AAAATTGTGA CAAATAAATC AGGGGATAAA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG 
GCGAAATTTG ATCCAAACAC AAGTAAATAT 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA 
ACGTATACTC GAATGGGTAA AGATAATGCT 
GAACCTTACC AAGGCACATA TCATCCAGCC 



TGCTTTATGA AAAAGAAAGT ATTAAGTTCG 
ATAGCAGGTT ATGCAAGTCC AGCATTTGCA 
GCAAATTTAG GCAAACATCA AAACAATGGC 
TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
TTTTTCGACA AAAATACTGG GGAAACCCGT 
GATGGCACAA AACGAATTTT AATTTCGCGG 
ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
GATGTTGAAG TTTACGTGGA ACACATCCCT 
GGACGTGAAG CATTAACCAA 'TCAAACTGGC 
ATTTTAGGAA CAACCTTGTG GAATGGCACA 
ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GAATTTTTCA ATTTACAAAC AGGTGAAACC 
GACAATAACA AGATTCGGGC CCATGTATCT 
GAATTAACGG AACTAAACAA TGATCGATTT 
GGTAATGATA TTCAAGTGTT CGTGGAACAT 
TTTACTTTCT AA 



EF103-2 (SEQ ID NO:398) 

MKKKVLSSI TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTGETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TF 



EF103-3 (SEQ ID NO:399) 



TCATGCAG CCAATCCAAA TAGTGCTACA G( 
CAAACAAGAG GCGACAAGGC GACTAAGATT 
GTTTATGATG CTGCTGGTAA TGATTTAACG 
AAATATGATG GTGAAACCGG TTTTTACGAG 
GGTGACGAAG GAACATTTTT TGTGACAGGT 
ACACAAAATT ATCAAGCCGT AGTGGATTTA 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT 
TATCATGGGA AAAAATTAGC TTTTACAAAT 
AAAATTGTGA CAAATAAATC AGGGGATAAA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG 
GCGAAATTTG ATCCAAACAC AAGTAAATAT 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA 
ACGTATACTC GAATGGGTAA AGATAATGCT 
GAACCTTACC AAGGCACATA TCATCCAGCC 



lAAATTTAG GCAAACATCA AAACAATGGC 
TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
TTTTTCGACA AAAATACTGG GGAAACCCGT 
GATGGCACAA AACGAATTTT AATTTCGCGG 
ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
GATGTTGAAG TTTACGTGGA ACACATCCCT 
GGACGTGAAG CATTAACCAA TCAAACTGGC 
ATTTTAGGAA CAACCTTGTG GAATGGCACA 
ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GAATTTTTCA ATTTACAAAC AGGTGAAACC 
GACAATAACA AGATTCGGGC CCATGTATCT 
GAATTAACGG AACTAAACAA TGATCGATTT 
GGTAATGATA TTCAAGTGTT CGTGGAACAT 
T 



EF103-4 (SEQ ID NO:400) 

HAANPNSATA NLGKHQNNGQ 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA 
DEGTFFVTGD GTKRILISRT QNYQAWDLT 



ENANFIGLAK YDGETGFYEF FDKNTGETRG 
EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
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HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPA 

EF104-1 (SEQ ID NO:401) 

TGAAAGGGGA TTAGTATGAA GAAAAAAACT TTTTCTTTTG TGATGTTGAG TATACTTCTC 
GCACAAAATT TCGGGTTTGC CGTAAATGCC TATGCTGTAA CAACGACAGA AGCACAAACA 
GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC ATCTTTACCT 
TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA ATCGCAAACC 
ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTQAACA ACAAACAACG 
GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA CAGCCAAACA 
GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
AAAAGTTATC TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC 
AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA 
ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG 
ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAACAAAC AACCGATTTT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT 
AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT 
AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT 
ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT 
GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA ACCAATTAGT 
GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT TATTAAGTTC 
ACTACGCCAA TAACGAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT 
TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGACCGCG GTCAATACTT TGATAAACCA 
ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC 
ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC 
TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT GAGTGCAACA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT 
GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC 
AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC 
ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC AGAAACAACA 
ATCGACTCTG AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 
ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA 
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TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTTGATGCCA TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
G TTAAAG ATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTCGAAAAA ATCACTGTT 

EF104-2 (SEQ ID NO : 402 ) 

MKKKTF SFVMLSILLA QNFGFAVNAY AVTTTEAQTE TTDTAKKEAE LSNSTPSLPL 
ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLiNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPFEKI TV 
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EF104-3 (SEQ ID NO:403) 
TGTAA CAACGACAGA AGCACAAACA 

GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC ATCTTTACCT 
TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA ATCGCAAACC 
ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA ACAAACAACG 
GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA CAGCCAAACA 
GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
AAAAGTTATC TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC 
AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA 
ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG 
ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAAGAAAC AACCGATTTT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT 
AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT 
AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT 
ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT 
GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA AC C AATTAGT 
GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT TATTAAGTTC 
ACTACGCCAA TAACGAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT 
TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGACCGCG GTCAATACTT TGATAAACCA 
ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC 
ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC 
TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT GAGTGCAACA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT 
GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC 
AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC 
ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC AGAAACAACA 
ATCGACTCTG AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 
ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA 
TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTTGATGCCA TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
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ATTTATGTCG GCGATTCATG GAAAC C AGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTC 

EF104-4 (SEQ ID NO:404) 

VTTTEAQTE TTDTAKKEAE LSNSTPSLPL 

ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPF 

EF105-1 (SEQ ID NO:405) 



TAAATGAAAA AAACAGTCGT CTACTCCTTG TTATTCGGAA CAATGTTGCT 
GTTCCTGCTG AAGCGGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT 
AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA 



TGGCGCCACT 
TTTTACCCCA 
ACCAGTTCGA 
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CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT ACAATATAAA ACAACATTGA CTTGGCTACT TTCAGATGTA 
CCAGTAAATA ATGGAGGGAA ATAA 

EF105-2 (SEQ ID NO: 406) 

MKKTWYSLL FGTMLLGATV PAEAATWFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 
VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNKDQTYF ARAQTYRNPD GSASELATAN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HKELTGATVA FTEPSVRSNA TDVLPPTATA 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 
TPKDAVQYKT TLTWLLSDVP VNNGGK 

EF105-3 (SEQ ID NO:407) 

GGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 

AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 
CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT . AC 

EF105-4 (SEQ ID NO:408) 

ATWFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 

VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNKDQTYF ARAQTYRNPD GSASELATAN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HKELTGATVA FTEPSVRSNA TDVLPPTATA 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 
TPKDAV 

EF106-1 (SEQ ID NO : 409 ) 

TAGTCGTTTA TGAAGAAAAA AATCGTTGGT ACAATTACGT TGTTGGCTTT AAGTGCGTTA 
TTAGTTGGTG GAGCAGGAGG GGCTTTGACG GCAGAAGCAT ACGTTCCTCA AAGCGTAGAC 
AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
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TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT 
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT TTAATCCTGA ATTTACCTTT TAA 



EF106-2 (SEQ ID NO:410) 



MKKKIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNLGDLPEY LRSVGIRQDE 
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 
AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
KDGNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEFTF 

EF106-3 (SEQ ID NO:411) 



AT ACGTTCCTCA AAGCGTAGAC 
AATCCCAATA ATTTAGGGGA TTTACCTGAG 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA 
TTAACAGATG AAAATCAAAA CCTATTACAT 
TATGAATTTT TTGATAAAGA GACTGGAGAA 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT 
GGTGTATACC CACTTAGAAC TTTATACCAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC 
TATGGTCGTC CGCAGCCATA CCCCAATAAT 
CGCCGTGCCA TGACAGAACA AACAGGCCAG 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG 
GATAAAGATG GCAATGACGT AACTTCGGCC 
GACCAAGATA GCAGCAAATA TGAATTTTTC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT 
AATCGCTATG GCGCTGTCTT AGAGTTAACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT 
CAAGGAACTT 



TATTTACGTT CAGTTGGTAT TAGACAAGAT 
ACACGCGTTT ATGATCGAAA TGGGAATGAC 
GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
GTAATTATTT CTGAAACCAA AAATTATCAA 
GATACTTTTA CGTATAGACA GATGGGGAAA 
GTAGAAAACA AAGCAACCTC AGGACCAGTT 
CGTCCCAGAA CACTAGAATT CACGAATGGA 
ATTGATGTAA ATCGACAAGG GGATGAAATT 
CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
AACCAAAACT TTATCAGCTT AGCGAAATTT 
AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
AAATTCCGTG CCCATGTTTC CATTGGAACC 
GAATTGAATG ATAATCGTTT TACGTACACA 
ATCCAAGTCT ATGTGGAACA TGAACCATAC 



EF106-4 (SEQ ID NO:412) 



YVPQSVDN PNNLGDLPEY LRSVGIRQDE 
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA 
AGITDVSRLV IISETKNYQG VYPLRTLYQD 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI 
KDGNDVTSAN- QNFISLAKFD QDSSKYEFFN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI 



IKFDATTSFY EFFDKETGES TGDEGTFFMT 
TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
QVYVEHEPYQ GT 



EF107-1 (SEQ ID NO:413) 



TAAAAAACGG CACTCAATAT 
ATANATANAA AAATGCTAGT 
CTTTATAGAG ACTATAGATT 
AATTGGAAAA GATGGCTAGT 



GTCAAAATTT GAAATTTCAA 
TATCAGTATC GATAATAACA 
GAATTTTTAC ATAGAAAGAA 
TGTTGGGTTA AGTTGTTCTT 



GCTGTGTGTT CTTTGGTAAA 
GGATACTGAT TAAGAAAGGA 
GGAGCAAGAT GAAGCGAGTA 
TGTTCATGGA TTCAGTGGTT 
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GGTGTGACTG TGTTAGCGGA AACGATTACT GGGGCGACGG AGCAAGGAGT AGCAACATCT 
CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGG AATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
GTGTCCATCA CGTtfGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCTAA 



EF107-2 (SEQ ID NO:414) 
MKRVN 

WKRWLWGLS CSLFMDSWG VTVLAETITG 
SEAKTVPPQE TARIASRAIG YSSVEGREIP 



ATEQGVATSQ SSDEASQTTQ TTEESQATVA 
FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
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KLQRTPLEPT TVNGGKLLS I PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPKI YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAAS FTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF107-3 (SEQ ID NO:415) 
GG AGCAAGGAGT AGCAACATCT 

CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATG AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
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GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCT 

EF107-4 (SEQ ID NO:416) 

EQGVATSQ SSDEASQTTQ TTEESQATVA 

SEAKTVPPQE TARIASRAIG YSSVEGREIP FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
KLQRTPLEPT TVNGGKLLS I PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPKI YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF108-1 (SEQ ID NO: 417} 

TAATCGGTTT GGCGGGAATC GTACATAGAA AGAAGGGACG . AC ATGAAGCA AACTAAGTGG 
CAACGATTAG CAACCATTGG CTTGTGTAGT TCTTTAGTAA TTAACGCCTT TTCTGGTGTG 
ACGGCAGTTG CGGAAACCGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 
GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 
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ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAAC CG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AGGTCCTTAA 

EF108-2 (SEQ ID NO:418) 

MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES SPTAESSAKE 
ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
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YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TWYNQPTELK NTVGTTSAIS LTANNTATSI IANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVTGP 

EF108-3 (SEQ ID NO:419) 

CGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 

GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 
ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
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AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTAT AC T A CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AG 

EF108-4 (SEQ ID NO:420) 

VTIES SPTAESSAKE 

ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTANNTATSI IANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVT 



EF109-1 (SEQ ID NO:421) 
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AGGAGTAAAT TAATGAAAAA AAGTGTTATA ACTAGTTCTA TGTTAGCGGT TTTGTTGTCG 
GGATTTCTCG TTACCCCTAT TTCTGCTTAC GCTTTGGAAC GCTCTAAGGG AACTACTGAA 
GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AAACAACGTC CACTTCCTCA 
ACCTCCGATT CCACCACTGA AGTTTCTACA TCAGAAGTAA CAACTGTTAA TGATACAGAA 
NATAGTAGCG ACGTACTGAA ACTACTTTGG NAACATCACN AAGTAATGAG GACACACCTA 
TAG 



EF109-2 (SEQ ID NO:422) 

MKKSVI TSSMLAVLLS GFLVTPISAY ALERSKGTTE ETVASETSLT ERQMSSGVTE 
EMNPSIINSQ EETETTSTSS TSDSTTEVST SEVTTVNDTE XSSDVLKLLW XHHXVMRTHL 

EF109-3 (SEQ ID NO:423) 

GGAAC GCTCTAAGGG AACTACTGAA 

GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AAACAACGTC CACTTCCTCA 
ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG 

EF109-4 (SEQ ID NO:424) 

ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ EETETTSTSS TSDSTTEVST S 

EF110-1 (SEQ ID NO:425) 

TAAATAAAAA TGGATAAGGA GTGGCATAAT 
AGTGCTGGTT TTTTG'JTTCT GATTTTAGTA 
AATGCAGAAG AGTATATCGT TCCTGCCGAA 
GACCCTGAGG ACAGAAGACA AGAAGTGGCA 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA 
GTTGGAACCA ATACAATTGT CACCAATAAT 
GTATTAAATC CGAATGCCAA AGATGATGCT 
ACACCATTTG GCAAATTCAA AGTGATTGAT 
GTAGTGACTG TCGGCAAACA AAACGATCGT 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT 
ACAGACTTAG AAAATCCATT ACTATTTTAT 
TCACCAATCT ATAATGATCA GGTTGAAGTA 
CAAACAGGAA ATCATGGTCA AAGACTAAAT 
GTGAATGAAG AAGAAAATAA ACGTTTATCC 

EF110-2 (SEQ ID NO:426) 

MKKFSIRKIS AGFLFLILVT LIAGFSLSAN 
PEDRRQEVAD TTEAPFASIG RIISPASKPG 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE 

EF110-3 (SEQ ID NO:427) 

AG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 



CTTATGAAAA AGTTCTCCAT ACGAAAAATT 
ACTTTGATCG CCGGTTTTAG CTTGTCTGCA 
AGTCATTCAC GACAAAAAAG ATCGTTACTG 
GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
GTAGCTTTTT CCCCGAATGC GGATATTGCG 
CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CATGAAAATG ATTTGTTTAC ATCTAACTTT 
GATATCGATA CAACCGGCGG TCAATCTGGT 
GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GCTGTGCCAG CAGCGTAA 



AEEYIVPAES HSRQKRSLLD 
YISLGTGFW GTNTIVTNNH VAESFKNAKV 
AFSPNADIAV VTVGKQNDRP DGPELGEILT 
ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
VNYNFIVNRV NEEENKRLSA VPAA 
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GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 
ACAGACTTAG AAAATCCATT AC T ATTTT AT GATATCGATA CAACCGGCGG TCAATCTGGT 
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGC C AG CAGCGT 



EF110-4 (SEQ ID NO: 428) 

EYIVPAES HSRQKRSLLD 
PEDRRQEVAD TTEAPFASIG RIISPASKPG 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE 

EF111-1 (SEQ ID NO:429) 



YISLGTGFW GTNTIVTNNH VAESFKNAKV 
AFSPNADIAV VTVGKQNDRP DGPELGEILT 
ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
VNYNFIVNRV NEEENKRLSA VPAA 



TGATCAATAC ACTTCGATAC GGTCGCTTTT TTTCTAGAGA AAGTTGAATC TTTCAATAAT 
AAAAAGGGAT ACACTCCATT TGGCATAGTC CTTGCTGATA ATAAATCAGT GTATAAAGCG 
CTATCATTTT ATAGGAGGGG TTTTATGAAG GGTTTATCAA AAAAGAAACG GGTGTCTACT 
TGGTTAGCGT TAGGAATCAC CGTAGTCAGC TGTTTTGCGT TAAGCAGGGA AGTGCAAGCA 
AGTGTTGAAA GAACAAAAGT TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 
AC CG AACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTGG CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGGAGAAACC TATGATTTAA GCCAAAGCAA CCGCATGGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAAC CTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATCTAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATCGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGCGGCA GGTGCCTTTA AGCATGACGG GATTGTTCCG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAACCACCAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAGTTTTTGA AAACAAAAAT AAAGGAAAAG 
TAG 
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EFlll-2 (SEQ ID NO:43< 

MKG LSKKKRVSTW 
LALGITWSC FALSREVQAS 
WHGYYLPEKS NKELLGGFAG 
SYYPGRLEQT YELDDLTIHL 
GTETLDIGTR LTAKDNDIQV 
EPIELKPKQT YNTYTTESYT 
QKKTAEFPEY QNALVKSIET 
TADFNPELAK NNMRALFDYQ 
PPLAAWAVWH IYQETKDKEF 
DDKDQIIKDK NGHLKWMMML 

EF111-3 (SEQ ID NO:43! 

TGATGAATTT GCAAATGTTT 
AC CG AACGGA CGAATGGCGT 
GCATGGCATG GCTACTATTT 
GGGCCATTGA TTATTGCGGA 
ACGGTCAAAA ATAAAAAAAC 
CTGTCTTATT ATCCTGGGCG 
TTAGCTTTAA TTTTTGTCAG 
GGTGAAGAGC CCTTGTCACT 
GAGGGAACGG AAACCTTAGA 
GTGAATTOTG GTGAAGTCAG 
ACGATTCATC ATGCGGATAA 
GCTGAACCAA TTGAATTGAA 
ACTTTTACAA AAGAAGAAGA 
GCGGCGCGCT ATTTCAAAGA 
GATCAAAAGA AAACAGCAGA 
ACGATTAATA CCAATTGGCG 
TCCATGTCTT ATAAATGGTT 
GCAACGGCTG ATTTTAATCC 
CAAATTCAAA AAGATGATAC 
TTTTACAATC AAGACAGTGC 
AAACCACCAT TGGCTGCATG 
TTTTTAAAAG AAATGTATCC 
GACCACAATA AAAATGGGAT 
AAAGACGACA AGGATCAAAT 
CTGTTATTGA AGCAGCCGCG 
AAGGTGTGGG CAAAGGCGAC 



) 



VERTKVDEFA NVLDVSASPT 
PLIIAEEYPV NLAASLNKLT 
ALIFVSNRTA LIQTTLENTG 
NFGEVRETWN YFATKDTKYT 
FTKEEEAKEQ QQAPEYTKNA 
INTNWRSAAG AFKHDGIVPS 
IQKDDTVRPQ DAGAIIDAVF 
LKEMYPKLVA YHNWWYTNRD 
LLKQPRGKVA WITLHGLTKK 

-) 

TAGATGTGAG TGCATCACCA 
ATACGATACC AATTATTTTA 
ACCTGAAAAA AGCAATAAAG 
AGAATATCCA GTAAACTTGG 
GGGAGAAACC TATGATTTAA 
CCTAGAGCAA ACCTATGAAT 
CAATCGAACG GCGCTTATCC 
TGGAGCAAGC TGGACAGGTG 
TATTGGCACT CGTTTAACTG 
AGAAACGTGG AATTATTTTG 
AGTTTCAACA AAAATTGATA 
GCCTAAACAA ACGTACAACA 
GGCAAAGGAA CAACAACAAG 
GAACAAGCAA AGATGGCAAG 
ATTTCCTGAA TATCAAAATG 
AAGTGCGGCA GGTGCCTTTA 
TATTGGTATG TGGGCTTGGG 
TGAGTTAGCT AAAAATAATA 
CGTACGTCCA CAAGATGCAG 
GCGTGGTGGT GAAGGTGGCA 
GGCAGTTTGG CATATTTATC 
CAAACTTGTG GCTTATCATA 
AGCAGAATAT GGAAGCATGG 
CATTAAAGAT AAAAATGGCC 
TGGGAAAGTG GCATGGATAA 
GTTGGAGTTA AAGTT 



ERTNGVYDTN YFNNFSDLGA 
VKNKKTGETY DLSQSNRMDL 
EEPLSLGASW TGAVFDKIQE 
IHHADKVSTK IDNRNYTATA 
ARYFKENKQR WQGYLDKTFD 
MSYKWFIGMW AWDSWKADVA 
YNQDSARGGE GGNWNERNSK 
HNKNGIAEYG SMVSDAHWQK 
VWAKATLELK FLKTKIKEK 



ATAATTTTTC TGATTTAGGT 
AGCTACTGGG TGGTTTTGCG 
CGGCAAGTTT AAACAAATTA 
GCCAAAGCAA CCGCATGGAC 
TAGACGATTT AACGATTCAT 
AAACGACACT TGAAAACACT 
CGGTCTTTGA CAAAATTCAA 
CTAAAGACAA TGACATTCAA 
CTACGAAAGA CACAAAATAT 
ATCGGAATTA TACAGCAACC 
CCTATACGAC AGAAAGCTAT 
CACCCGAATA TACCAAAAAT 
GATATCTAGA TAAAACGTTT 
CGCTAGTCAA ATCGATTGAA 
AGCATGACGG GATTGTTCCG 
ATTCGTGGAA AGCGGATGTA 
TGCGGGCCTT GTTTGATTAT 
GAGCGATCAT TGATGCTGTC 
ACTGGAATGA ACGAAATTCT 
AAGAAACCAA AGATAAGGAA 
ATTGGTGGTA TACCAACAGA 
TCAGTGATGC TCACTGGCAA 
ACCTAAAGTG GATGATGATG 
CGCTACACGG TTTGACAAAG 



EF111-4 (SEQ ID NO:432) 

DEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 

WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
QKKTAEFPEY QNALVKSIET INTNWRSAAG AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
PPLAAWAVWH IYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK 
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EF117-1 (SEQ ID NO:433) 

TAATTCGATG GAGAAGGTGG TTTAGTGAAA AGATTTTCAT TTTTTTTACT AATTTTACTT 

GCTTTAACAG GTTGTAAATC CGGTGAAAAA GAATTTGATG AAGAATCTCT TCAAAATCTA 

AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT TCGTTTAAAT 

GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG TTTAATAAAA 

AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN TAATGAGCAA 

AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA CGGCTTTTTG 

AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAATTA G 

EF117-2 (SEQ ID NO:434) 

VKR FSFFLLILLA LTGCKSGEKE FDEESLQNLK ETXQSXSETE LQNGDVRLNE * 
YISLKGEIVE SDSRSSLIKK GDRFILKSGS SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK 
GTLIESEENH DSATN 

EF117-3 (SEQ ID NO:435) 

TG AAGAATCTCT TCAAAATCTA 

AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT TCGTTTAAAT 
GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG TTTAATAAAA 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN TAATGAGCAA 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA CGGCTTTTTG 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAA 

EF117-4 (SEQ ID NO:436) 

EESLQNLK ETXQSXSETE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK GDRFILKSGS 
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN 



EF118-1 (SEQ ID NO:437) 

TGAGGGGGAA AAAGTGTGTT AAAAAGAAAA 
GCTTTGTTAC TGACAGGTTG TGGCAAAAGT 
AATCANAACG CACAAGAATC TGGGGTTTGN 
TTTTCACAAG AAGATGGTGC ACAAACGAAT 
AAAGACGCAT CGCTTTCTGG GGAAGATTCA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT 
AGATAA 

EF118-2 (SEQ ID NO:438) 



GTGGGGATTG TCGCAGGCGT TTTCTGTTCA 
GCGAAAGATG AGTTCATTCA AGGAATCGGC 
GATTTCTCTA TGTCAATTAG TGACATGAAA 
CCTATGATTG GGATGCTCAT CACGCAAATC 
AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
GGATGTACCG ATTTCATTGG TTGGATCGTT 



VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGIGN XNAQESGVXD FSMSISDMKF 
SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF118-3 (SEQ ID NO: 439) 

GAAAGATG AGTTCATTCA AGGAATCGGC 

AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC 
AAAGACGCAT CGCTTTCTGG GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGAT 
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EF118-4 (SEQ ID NO:440) 

KDEFIQGIGN XNAQESGVXD FSMSISDMKF SQEDGAQTNP MIGMLITQIK DASLSGEDSS 
RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF119-1 (SEQ ID NO:441) 

TAAAGAATAC CGAGTAAAAT TTTCGGAAGG CTTTTTTTCA AAAATTGTAT ATGCAAAAGA 
AGTGCAACGG AAAGGAGCTC GGAAATCGTG AATAAGCTAC CTTTACTTAT TTTATTGTTA 
GGCGGAGTGT TGCTTGTTAG TGGCTGTCAA AGCCATAAGG AAGAAAACAA GTCTAGTAAA 
GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TATTAG 



EF119-2 (SEQ ID NO:442) 

VN KLPLLILLLG GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE 
TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EKLKPLMTEK CIKKNGIDVK TGVALVSVGK 
VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL AKVKNNKISE MTYNSVKQEY 

EF119-3 (SEQ ID, NO:443) 

AGAAAACAA GTCTAGTAAA 

GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TAT 

EF119-4 (SEQ ID NO:444) 

ENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV KLLEAYGNAY 
ANFTSINDRN 

EKLKPLMTEK CIKKNGIDVK TGVALVSVGK VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL 
AKVKNNKISE MTYNSVKQEY 

EF120-1 (SEQ ID NO:445) 

TGAATAGGCG TGAAAAAGGG AATGTTAGCG TTTTTTGTCG TGCTAGCGGT TTTATCATTA 
ACTGCTTGTC GGGAACCAAA AGNAAAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 
GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
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ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-2 (SEQ ID NO: 446) 

VKKGMLAF FWLAVLSLT ACREPKXKKV TASTEASSKV EETNEKTSET IDKTNEQASS 
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 

EF120-3 (SEQ ID NO:447) 

AAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 

GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-4 (SEQ ID NO: 448) 

KKV TASTEASSKV EETNEKTSET IDKTNEQASS 

SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 



EF121-1 (SEQ ID NO: 449) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 
CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
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GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG "AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATC CGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
C C AGAAAC AG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 

EF121-2 (SEQ ID NO:450) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
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PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTOLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY IYKRRNKAS 

EF121-3 (SEQ ID NO:451) 

ACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 

CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTC TTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGC CGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCG 



EF121-4 <SEQ ID NO: 452) 



QSSEAV TSTTDSSRKQ 

EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
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KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN A 



EF122-1 (SEQ ID NO:453) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG 
GAAACAACGA GTCAACAAAG TTCAGAAGCA 
CAAGAACCAG TCATTACACA GGAAACAACA 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG 
ACGTCAGATG TTCATGGTCA ATTATGGAAT 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT 
ACCGTTTTAA TTGATAATGG CGACAATATT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC 
AAGTATGATG CAATGGTTTT GGGAAATC AT 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC 
GGTCTTCGTT TTGTTGAAGG GACTACCACG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT 
GAGTTGAAAG CTAACGATCA GGCTGACATT 
AATAGTGATC CGGCTGCCAG TGCCGACCAA 
TATATTCTGG GTCATGACCA CCTTTCTTTT 
ACTGTACCGG TAGGGGGACC GAAAGATACG 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA 
GTTCAAAAAG AAGTAACGGG CGCACAATTA 
AAATTACCTG CGGGGAAGAT TTCCTATGCC 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA 
GCGTACTATA ACCAAACACA GCCAGATGAT 
GTATATAACT ATGACATGAT TTCTGGAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC 
ACGATTGCTA TGAATAATTA TCGTTACGGC 
GAACCTATTA AAAATTCTGA TCCAGAAACC 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA 
ATTCCGACTT CTCCTGATGG ACGTACACCA 
CGTAATGCGG GCTTTGATTT AGATAATGCA 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA 
CAAGAAAACC CAACCTTGAT GGTGGATGCA 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT 
CTGAATTTTC CGATTTTATC TAGTAATACG 
GATCCGTATA CAATCGTAGA AAAATCCGGG 



TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GTAACAAGTA CCACCGATTC AAGTAGAAAA 
GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACCACAGCAC CAACAGAGAC GACGAATTTA 
AGCACGCCGC AAAAAATAAC AATTTTAGGT 
TGGTCTTATG AAGATGATAA AGAACTACCA 
AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
CAAGGCACTA TTTTAACAGA TGACTTGTAT 
CATCCAATGA TCACCGCCAT GAATGTGATG 
GAGTTTAATT TTGGTTTACC GTTAATCAAA 
TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
AAGGAACTTG ATTTTAATCA AGATGGGCAG 
ACAATTCCGC ACATTCCTTT GTGGGATGGC 
TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
ATTGTTGCCT CGATTCATGC GGGACAACAA 
GTAATTGAAA ATGTCGCGGG GATTGATGCG 
ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
CAAGAAGGTA CAGCAACGAT TGTACCAACA 
GCAGCGACAA AAGAATACCA TGAAAAAACG 
GCAACAGCTG ATTTTTTACC AAAACAAGAA 
CCAACAGCGA TGATTTCTTT AATTAATAAC 
AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
ACGATTTTTG ATATCTACAA ATACCCGAAT 
AACTTACTGA AGTATTTAGA AAAACAAGGG 
TTGACCATTA GTTTTAATCC AAACATTCGT 
GACTACAAGA TTGACATTTC AAAACCAGTG 
GGCCAACCGC TGGATCCTGC CAAAGAATAT 
GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
TTACGAGGAA TGATTGTTGA TTATATTAAG 
ATCGAACGAA ATTGGTCAAT TATTGGGACA 
ATCGAATTAG TGAATGACGG CACTCTTCAA 
AACGCCGCCG CTATTACGAA ACAAGATGTC 
TATACCATTA TGCACACAAA TGACGTTCAT 
GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
GGGGATGTTT TCCAAGGATT ACCAATCTCC 
GCAATGAATG AAGTTGGTTA TGATGCCATG 
TTAGAGATTG CACTAGGTTA TAAAGACCAA 
TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
AAAAAGTTTG CCATTGTAGG TGTGACGACC 
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CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGC AGTT AAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TeGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 



EF122-2 (SEQ ID NO:454) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY IYKRRNKAS 



EF122-3 (SEQ ID NO:455) 
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TG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TT 

EF122-4 (SEQ ID NO:456) 

EKWRAKAI ELVNDGTLQI 

PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGF 

EF123-1 (SEQ ID NO: 457) 



TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 



TCTCTTCTAT GTGTCGTTAG 
GTAAACCGTT GGCTCTACGG 



TAGAGGAAGG 
GTTGATGTGT 
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TTGTTACTTG TTCTAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 
AAACCTAATA AACCAGGTTC AGGTGATTTA 
TACACAATTA GTGAAGATGG TACGGTTCGT 
AGTGACATTC ACGGGGACTT TTCTTTAGAT 
CCAGGAGATT GGGTGATTGA TATTCCTACA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC 
ACTGTGACGG AAACATGGCC AACAGGGAAT 
GTGATGAATC TTGATGGAAC AATTAAAGAA 
ACCGTTGATA AAAATGGCAA TGTGACGATT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT 
GTTACCGCCA CATATGGCAA AATGTTAGAC 
CAAGAATTCA CTTGGGAAAT TAACTACAAC 
GCAGTCATTA CAGACACAAT GGGGGATAAT 
TATTCAGTGA CATTTGATGA CAAAGGAAAT 
AAAGATTACA AAGTGGTAAT CAACGGAGAC 
GTGACTGGCG CAGTCAAGAT TGATTATAAA 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC 
GCCAGTCAAC AAAATATTAT TAAAAACACT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT 
TTCCACATAA CTTATACTAC CTTTTTCGAT 
TTGGACCATT ATCGAAATAC CGCTGCCATT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA 
AATAATCGTT TAGTCGACGC CTTTTTGACG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT 
AAACCAACGC AACCGTTGAC GGATATCACA 
ACTTGGCGTG TTGATTTTCC TAATGATAGT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT 
CAAGGTTCTT CACGTGATGT GACAGGAAAA 
AAAAAAGGTG GCGAATACCA CAAAGATGAT 
AATGGCGCCC AATCGGTTTT AGACGATGTG 
GTGCTAGATC CCGAGTCATT GGTGATTTAC 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA 
GATAATGAAA CAGGACAACA AAAAATTGTC 
TATATGGAAT ATCGTAGTTT AGTGACTTCT 
AACCAAGTGT CAATTACTGG AAATGGTTCA 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG 
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CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAAC AAC TGAATCGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
GTTGATGCAG AAGGCAATGT CTATGGAACC 
TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
AAACAAGGCC ATTTTGATCG AACGCCCAAT 
AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGGGTCGCG AACTTAGTCC AGATGAATAT 
AAAGGTGACA CCAACAAAGC GTATCGTCTT 
ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AATCCAAATG GGTTAGATGC TGAAGCAACT 
AAGCGCAATA TAGATTACGA CGAAGCCAAT 
TATGGTGAAC AAACCATTCC AAAAGACCAA 
TTAACGTTTG AACCAGATTC TTTACATTTA 
GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
GGTTCCTTTG CAATTGACTT TTTACATGAT 
ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GGTGCAGTTG ATTATCAAAA TTCAACGATT 
TATTTGATGG AAAATGCCGT GATTACGGAT 
GTACCCAATT CGTTGGTTGT CAAAGATACA 
GATTTCATGG TAGAAATAAC TCGTAATGCA 
ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
GTTACCGAGT TAGACGCTAA CAATCCTGCG 
GATTGGACGG ATGAAGCAGG AAACAATCAT 
TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
ATCACTTGGA CGATTGCGGT TAATTTAAGT 
GATCCAATTT TAACCAATCA AACCTATTTG 
ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
CGTACGTATG TGATTGAATT TAAGACGTCT 
AGTTATGACA ATACCGCATC TTATACAAAC 
GTTTCTATTC AACATGGTGG CGAATCAGTG 
CCAGATCATG TGTACTGGCA TGTAATGATC 
GTTATTACTG ATACACCCTC ACCAAACCAA 
GGTACCAACG TAACAGAAGA CGGAACTATT 
GGAAAAGATT ACACACTGGA AGTTACCACC 
GTTAAAATGG CCCATATTGA AGCACCTTAT 
TCAGCGGCGG GGAGTACAGA CACGGTATCC 
GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
CATGCCACAG GGACTAAAGG CAAAATTCAG 
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TABLE 1 . Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes. 

CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC G AATGGC C AG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF123-2 (SEQ ID NO:458) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE WVHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
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TABLE L Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS G VYNAVTKE I TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATV I ATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKI IFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF123-3 (SEQ ID NO:459) 

GGAAGA GGTTAACAGC 

GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
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TABLE 1. Nucleotide and Amino Acid Seqeuences ofE.faecalis Genes. 



GTGATGAATC 
ACCGTTGATA 
GAGTACCAAA 
AAAAATCACG 
GTTACCGCCA 
CAAGAATTCA 
GCAGTCATTA 
TATTCAGTGA 
AAAGATTACA 
GTGACTGGCG 
GTTGCCGTGA 
GCCAGTCAAC 
GGTTGGACGT 
ACGTACGAAC 
ACCACTGGTG 
GATGGTGAAA 
TTCCACATAA 
TTGGACCATT 



TTGATGGAAC 
AAAATGGCAA 
CGACGATTGA 
CGACGTTAAC 
CATATGGCAA 
CTTGGGAAAT 
CAGACACAAT 
CATTTGATGA 
AAGTGGTAAT 
CAGTCAAGAT 
ATAATCGTGT 
AAAATATTAT 
TAGCTGTGAA 
CAGTTCCTGG 
CTCAGTTGAC 
CAGGCTTTAA 
CTTATACTAC 
ATCGAAATAC 



AATTAAAGAA 
TGTGACGATT 
CGAGGCGGTT 
AAGTGATAAT 
AATGTTAGAC 
TAACTACAAC 
GGGGGATAAT 
CAAAGGAAAT 
CAACGGAGAC 
TGATTATAAA 
GGATGTTGGC 
TAAAAACACT 
TCAAAATAAT 
CTTAACTATG 
GTTAGGCAAG 
GGTAAGTTTT 
CTTTTTCGAT 
CGCTGCCATT 



GTGGGTCGCG 
AAAGGTGACA 
ATTCCAGATG 
AATCCAAATG 
AAGCGCAATA 
TATGGTGAAC 
TTAACGTTTG 
GAAGTCGTTG 
GGTTCCTTTG 
ACCAAAGTTG 
ACTGGTCAGC 
GGTGCAGTTG 
TATTTGATGG 
GTACCCAATT 
GATTTCATGG 
ATAGGGGCGT 
GTTACCGAGT 
GATTGG 



AACTTAGTCC 
CCAACAAAGC 
GCGGCGGCGA 
GGTTAGATGC 
TAGATTACGA 
AAACCATTCC 
AACCAGATTC 
GAGCAGAACT 
CAATTGACTT 
ATGGAATTGT 
ATTCAGAAGA 
ATTATCAAAA 
AAAATGCCGT 
CGTTGGTTGT 
TAGAAATAAC 
ATGCCAAAAC 
TAGACGCTAA 



AGATGAATAT 
GTATCGTCTT 
TGTGCCTTTT 
TGAAGCAACT 
CGAAGCCAAT 
AAAAGACCAA 
TTTACATTTA 
TGTGGAAGGA 
TTTACATGAT 

cgaaggcgat 
tgatggcaca 
ttcaacgatt 
gattacggat 
caaagataca 
tcgtaatgca 
aagtgatgcc 
caatcctgcg' 



EF123-4 (SEQ ID NO:460) 
EEVNSD 

gqltlgevkq tsqqemtlal qgkaqpvtqe wvhysanvs ikaahwaapn ntrkiqvddq 
kkqiqielnq qaladtlvlt lnptatedvt fsygqqqral tlktgtdpte staitsspaa 
sanegsteea stnssvprss eetvasttka ieskttestt vkprvagptd isdyftgdet 
tiidnfedpi ylnpdgtpat ppykedvtih wnfnwsiped vreqmkagdy fefqlpgnlk 
pnkpgsgdlv daegnvygty tisedgtvrf tfneritses dihgdfsldt hlndsdgrgp 
gdwvidiptq edlppwipi vpdteqqidk qghfdrtpnp saitwtvdin qamkdqtnpt 
vtetwptgnt fksvkvyelv mnldgtikev grelspdeyt vdkngnvtik gdtnkayrle 
yqttideavi pdgggdvpfk nhatltsdnn pngldaeatv tatygkmldk rnidydeanq 
eftweinyny geqtipkdqa vitdtmgdnl tfepdslhly svtfddkgne wgaelvegk 
dykwingdg sfaidflhdv tgavkidykt kvdgivegdv avnnrvdvgt gqhseddgta 
sqqniikntg avdyqnstig wtlavnqnny lmenavitdt yepvpgltmv pnslwkdtt 
tgaqltlgkd fmveitrnad getgfkvsfi gayaktsdaf hityttffdv teldannpal 
dhyrntaaid w 



EF124-1 (SEQ ID NO:461) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 
TTGTTACTTG TTCTAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAACAAC TGAATCGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecaiis Genes. 

AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
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CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 



EF124-2 {SEQ ID NO:462) 



MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE WVHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
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PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADY I IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF124-3 (SEQ ID NO:463) 

TGCCTTCCACAT^CTTATACTACCTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG G 
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AF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW 
PDKSVILEEG KDYTLEVTTD NETGQQKIW 
QVSITGNGSE WHGDDNGDV WDIDHSGGH 
QAKTQVLREG TVDATGVITF GGLPQGQYIL 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN 
SAFTIAASDR GKPATVIATA NFVNYQGTAK 
TTNNQG 



PAFDLNAQKS GVYNAVTKE I TWTIAVNLSN 
KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
VETKAPEGYT VSDELAKGRV ITIDEETSAE 
FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
LIKKDVNGHL LSGATFKVLD AKGETIQTGL 



EF125-1 {SEQ ID NO:465) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 
TTGTTACTTG TTCTAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 
AAACCTAATA AACCAGGTTC AGGTGATTTA 
TACACAATTA GTGAAGATGG TACGGTTCGT 
AGTGACATTC ACGGGGACTT TTC TTTAGAT 
CCAGGAGATT GGGTGATTGA TATTCCTACA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC 
ACTGTGACGG AAACATGGCC AACAGGGAAT 
GTGATGAATC TTGATGGAAC AATTAAAGAA 
ACCGTTGATA AAAATGGCAA TGTGACGATT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT 
GTTACCGCCA CATATGGCAA AATGTTAGAC 
CAAGAATTCA CTTGGGAAAT TAACTACAAC 
GCAGTCATTA CAGACACAAT GGGGGATAAT 
TATTCAGTGA CATTTGATGA CAAAGGAAAT 
AAAGATTACA AAGTGGTAAT CAACGGAGAC 
GTGACTGGCG CAGTCAAGAT TGATTATAAA 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC 
GCCAGTCAAC AAAATATTAT TAAAAACACT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAACAAC TGAATCGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
GTTGATGCAG AAGGCAATGT CTATGGAACC 
TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
AAACAAGGCC ATTTTGATCG AACGCCCAAT 
AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGGGTCGCG AACTTAGTCC AGATGAATAT 
AAAGGTGACA CCAACAAAGC GTATCGTCTT 
ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AATCCAAATG GGTTAGATGC TGAAGCAACT 
AAGCGCAATA TAGATTACGA CGAAGCCAAT 
TATGGTGAAC AAACCATTCC AAAAGACCAA 
TTAACGTTTG AACCAGATTC TTTACATTTA 
GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
GGTTCCTTTG CAATTGACTT TTTACATGAT 
ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GGTGCAGTTG ATTATCAAAA TTCAACGATT 
TATTTGATGG AAAATGCCGT GATTACGGAT 
GTACCCAATT CGTTGGTTGT CAAAGATACA 
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ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA -GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGC AC CGAC A 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
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ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 



EF125-2 (SEQ ID NO:466) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVWNRVDVGT GQHSEDDGTA 
SQQNI IKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVS I TGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 
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TAACTTTG TTAACTATCA AGGCACGGCT A) 
TTATTAAGTG GTGCGACATT TAAAGTGCTT 
TTGACGACAA ATAATCAAGG GGAAATTGTT 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG 
GTTTTTGAAT TATATGATCA CAATAAACAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG 
GCACCAAAAT TACCAGATGG CGCAGATTAT 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA 
GATGCAACGG ATGGCTATAT CGTCAATAAA 
TCAAATGATA AACAACCACT AGATGAGTTA 
GGACGTAAAG TCAACGAGCA AGGTCAAACC 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA 
AAAGTTTCTG AAATAACAAC GGATAAGACT 
GGGCATTACG TTTTAGTGGA AACGAAAGCA 
CATCCATTTG ATGTAACCGC CCAATTAGGA 
ATCAATTATC AAGGAACTGC TCAATTAACC 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT 
GAATTTAAAG TGACAGATGC AGAGACAGGG 
AACCAAGGGT TAGTTCAAGT GAACCACTTA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG 
GCGAAAGACA AACCTGAACT CGTGAATGCG 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA 
CTTGGT 



lATTAATCA AAAAAGATGT GAATGGACAC 
GATGCGAAGG GAGAAACGAT TCAAACAGGC 
GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTATTAAATA CCACGCCAGT CCCATTTGAA 
GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
AATAGCGCAG ACCAACCATT AGCAGGTGCT 
TCATTAGGGA TTACAGCAAC GAGTGGC AAA 
CCAGGTACCT ATTATTACAA AGAAATCAAA 
ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
GCCAATGCGA ACCCACTTCC AGGAACGATT 
AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
GGTGCTGGTA GCTATGAATT AGATGAACTG 
CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
CCGATAACAT TCTTGAATCG TGCAGGAGAA 
GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
CCAACAGGCT ATCTGTTAGA CACAACGCTA 
AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGGCAAACCG TAGATGGACA AACCAATCTG 
AAAAACTTAG CACCGGGAAC GTATCGTTTT 
CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
CAACCAGGAA AATATACCTT TGTGGAAACA 
CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GGCACGTTTG TTAACGAGAA ACAACCTGTA 
ACGAAACAAG CAGCTAGAGA GACAGGTTGG 



EF125-4 {SEQ ID NO:468) 



NFVNYQGTAK LIKKDVNGHL 
TTNNQGEIVA EHLAPGKYRF 
KGAFQIVKTN SADQPLAGAV 
PKLPDGADYI IYPELVKVEI 
KLYRIENGEK IFEREVTAEK 
NDKQPLDELE FVNYQAEVMG 
VSEITTDKTG EIYAKGLNEG 
NYQGTAQLTK ENETGEALAG 
ETQAPTSYLL NETPSASFTI 
FKVTDAETGQ . TVARSLRSDN 
KDKPELVNAG TFVNEKQPVS 



LSGATFKVLD AKGETIQTGL 
VETKAPTGYL LNTTPVPFEI 
FELYDHNKQS LGITATSGKD 
RGDFKGDPEI FQLGAFANFK 
DGSLAMEDLG AGSYELDELD 
RKVNEQGQTL AGAVFAIYNA 
HYVLVETKAP TGYLLDTTLH 
AVFKVIDETG QTVDGQTNLM 
AKDNQGKPAT WLKAPFINY 
QGLVQVNHLQ PGKYTFVETK 
KKTKPNQPTT KQAARETGWLt 



AEKNAGKPAV WASDNFVSY 
GKI IFRDLAP GTYYYKEIKA 
GRAVFKKIDA NANPLPGTIF 
ATDGYIVNKQ PIYFWKKNS 
DEQNQPQGSP ITFLNRAGEK 
PFDVTAQLGK EQPIALGDLI 
SDKQGKVIAK NLAPGTYRFV 
QGAAKLVKID QQKNALAGAE 
APDGYQLSKQ AVAFTIAATA 



EF126-1 (SEQ ID NO:469) 



TAGCGAAAGA AAATAGGGAG 
ATGGTGATTG TCGCTGGAAC 
GAAGCGGTTA AAGCAGGAGA 
AGTCTGGCTG ATTGTAAACG 
GAAACGGAAC CAGTCGATTT 



GATTAAAATG TTTAAGAAAG 
AGTTGTGGGA AA1TTCAGTC 
TACAGAAGGA ATGACCAATA 
GATATTGGAA GGACAAGCTA 
AGTAGTTGTT GAAGATGCTA 



CAACGAAATT ATTATCGACA 
CCACATTGGC TTTAGCTGAA 
CGGTGAAAGT GAAAGACGAC 
CTTTCCCAGT TCAAGCGGGT 
GTGGTAGTTT TTCAGATAAT 
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TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT. AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGAC TTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACC ATTAC T " ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF126-2 (SEQ ID NO:470) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF FHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
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IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF126-3 (SEQ ID NO:471) 
TGAA 

GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC GAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGC TTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGAT 

EF126-4 (SEQ ID NO:472) 

EE AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAID 



EF127-1 {SEQ ID NO:473) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA 
AGTCTGGCTG ATTGTAAACG GATATTGGAA 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT 
TTTCCACATG TAAGACAAGC GATTGATGAA 
GTGATGCTGG CTTCATATCG CGGCGGAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG 



TTTAAGAAAG CAACGAAATT ATTATCGACA 
AATTTCAGTC CCACATTGGC TTTAGCTGAA 
ATGACCAATA CGGTGAAAGT GAAAGACGAC 
GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
CAATTTATGT TTCCTGATGG AAAGACAAAA 
CGCGTCAATA CGCAATTGAC TTATGATAAA 
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AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGC C AGAAG A ■ TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF127-2 (SEQ ID NO: 474) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
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DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 

EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 

TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 

PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF127-3 (SEQ ID NO: 475) 

GAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 

ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAAT 



EF127-4 (SEQ ID NO:476) 
NQG TIAKEFPEAT 

IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDIN 

EF128-1 (SEQ ID NO:477) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
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TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EP128-2 (SEQ ID NO:478) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF128-3 (SEQ ID NO:479) 
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AGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 

CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TAGATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCAT 

EF128-4 (SEQ ID NO:480) - 

DENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF129-1 (SEQ ID NO:481) 

TGACAAGTGA AGAAACGTCT ATTTGCATCA 
ATTGCTACCC CAAGCATCGC TTTGGCGGAC 
CAAGAAATTT CATCATTAAA AGCAAAACAA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA 
CTAAAAGCAA AATCAGAACA ATTACAACAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC 
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT 
ATGAAAGCTT CATTAGCATT AGAACAATCA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA 
GAATCAAGCA CGCAACAAAG AACTGAAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA 
CGTCCAGTAG TATGGGATGC AGGTTTGGCA 
GAAGCAGGTG GCATTCCAAA TGATCACTGG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT 
TACTCAGGTA GCACAATCGT AGGACACTCA 



GTATTACTAT GTTCATTAAC GCTATCAGCA 
AATGTTGATA AAAAAATTGA AGAAAAAAAT 
GGGGATTTAG CTTCACAAGT ATCTTCTTTA 
AGCATGGCTT TACGTGAACA AAAGCAAACA 
GAAATTACAA ACTTGAATCA ACGTATTGAA 
CGTGATGTTC AAGTTAATGG ACAAAGCACA 
TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GACTTAATGC AACAACAAAA AGAAGACAAA 
GAGAAAAAAG TGAAACAACT TGAAGCAACA 
TTACTTTCTA AACAATCTGA ATTAAACGTA 
TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA 
GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
ACTACACCAA GTACAGATAA TAGTGCAACA 
CCAGTACAAC CTACAACACC AAGCGATAAT 
ACAGTTACAC CAACACCAGA ACCAACACCA 
TTGAACGTTC TACGTCAATC ATTAGGTTTA 
GCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGTACAATG AAACAAACAT GGTAACAGCT 
AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
GCCTAA 



EF129-2 (SEQ ID NO: 482) 
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VKKRLFASV LLCSLTLSAI ATPSIALADN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 
AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF129-3 (SEQ ID NO:483) 

GGAC AATGTTGATA AAAAAATTGA AGAAAAAAAT 

CAAGAAATTT CATCATTAAA AGCAAAACAA GGGGATTTAG CTTCACAAGT ATCTTCTTTA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA AGCATGGCTT TACGTGAACA AAAGCAAACA 
CTAAAAGCAA AATCAGAACA ATTACAACAA GAAATTACAA ACTTGAATCA ACGTATTGAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA CGTGATGTTC AAGTTAATGG ACAAAGCACA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC GACTTAATGC AACAACAAAA AGAAGACAAA 
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC GAGAAAAAAG TGAAACAACT TGAAGCAACA 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT TTACTTTCTA AACAATCTGA ATTAAACGTA 
ATGAAAGCTT CATTAGCATT AGAACAATCA TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
GAATCAAGCA CGCAACAAAC AACTGAAACA ACTACACCAA GTACAGATAA TAGTGCAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA CCAGTACAAC CTACAACACC AAGCGATAAT 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA ACAGTTACAC CAACACCAGA ACCAACACCA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA TTGAACGTTC TACGTCAATC ATTAGGTTTA 
CGTCCAGTAG TATGGGATGC AGGTTTGGCA GCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
GAAGCAGGTG GCATTCCAAA TGATCACTGG TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG TGGTACAATG AAACAAACAT GGTAACAGCT 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
TACTCAGGTA GCACAATCGT AGGACACTCA GCC 

EF129-4 (SEQ ID NO:484) 

DN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE , 

AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF130-1 (SEQ ID NO:485) 

TGATACATTA AAAGGAGGGA AAATATGCGC CCAAAAGAGA AAAAAAGAGG AAAAAATTGG 
TTAATCAACA GTTTATTAGT TTTACTATTT ATCATTGGCT TAGCCTTAAT TTTTAACAAT 
CAGATACGTA GTTGGGTGGT TCAACAAAAT AGCCGCTCGT ACGCCGTTAG CAAGTTGAAA 
CCAGCTGATG TGAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG 
TCCTTGAGCA CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT 
GGTGCCATTG CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT 
GTCGCTTTAT TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT 
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TATGCCTTGG CTAGTCATCG AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA 
ACCAAAAAAG ACGAACTCAT TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA 
ACTTCTGTAG AAAAAATCGA ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA 
AATATGATTA CCTTAATTAC CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA 
GGAACATTAG CAGCAACGAC GCCTATTAAA GACGCCAACG ACGATATGTT GAAGGCTTTC 
CAATTGGAGC AAAAAACTTT AGCCGATTGG GTGGCTTAA 



EF130-2 (SEQ ID NO:486) 

YIKRRENMRP KEKKRGKNWL INSLLVLLFI 
ADVKKNMARE TTFDFDSVES LSTEAVMKAQ 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL 
LEQKTLADWV A 



IGLALIFNNQ IRSWWQQNS RSYAVSKLKP 
FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
VSLFSPLERT KKDELIYITD LSTVYTYKIT 
QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 



EF130-3 (SEQ ID NO:487) 



CGTTAG CAAGTTGAAA 
CCAGCTGATG TGAAGAAAAA 
TCCTTGAGCA CAGAAGCGGT 
GGTGCCATTG CGATACCAAG 
GTCGCTTTAT TAACTGGTGC 
TATGCCTTGG CTAGTCATCG 
ACCAAAAAAG ACGAACTCAT 
ACTTCTGTAG AAAAAATCGA 
AATATGATTA CCTTAATTAC 
GGAACATTAG CAGCAACGAC 
CAATTGGAGC AAAAAACTTT 



TATGGCTCGT GAAACAACGT 
GATGAAAGCC CAATTTGAAA 
TGTCGAAATT AATTTGCCCA 
CGGGACCATG AAAGAAGATC 
AACGGAAGAT GGCGTTTCCT 
TTATATCACT GATTTATCTA 
ACCAACCCGT GTTGAGTTAA 
CTGTGGCGAT TTACAAGCAA 
GCCTATTAAA GACGCCAACG 
AGCCGATTGG GTGGCT 



TTGACTTTGA TTCAGTTGAG 
ACAAAAACTT ACCTGTGATT 
TTTTTAAAGG ATTGTCCAAT 
AAGTCATGGG GAAAAACAAT 
TATTTTCACC TTTAGAAAGA 
CTGTTTATAC ATACAAAATA 
TTGATGACGT TCCTGGTCAA 
CGACGCGAAT TGCTGTTCAA 
ACGATATGTT GAAGGCTTTC 



EF130-4 (SEQ ID NO:488) 



VSKLKP 

ADVKKNMARE TTFDFDSVES LSTEAVMKAQ 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL 
LEQKTLADWV A 



FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
VSLFSPLERT KKDELIYITD LSTVYTYKIT 
QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 



EF131-1 (SEQ ID NO:489) 



TAGGCGGAGG TAAGCGGTAT 
TGGCTTTTTA TAGTATGTTT 
TTCTTTTTCA CTAGAGATTC 
CGCCGAAGTG ATAATTATGC 
CTTGATCAAA AAATTCAAGA 
CAGGTTTTAG TAAATAAAGG 
CCAAACACAA GGTTTCAGAT 
AAAGCAATTG AAGAAGGTAA 
ATTCAAGGTG CTGAGGATAT 
TTATCAGCAA TGCCTAATAA 
AATACCATTC AAGTCAATAA 
GCAGGAATGT TAGAGAAAAT 
CACAAAACGG CTGGTTTAAA 
AATTCAACAA GTTATAAATG 



GCGTAAACGA CATGCAAAGA 
GTTGGTGGTG ATTGGTGGTA 
ACAAGTTAGT CAAGAATCGA 
GAATTTAACG AAAGAAATAG 
AACAAATTAT ATTGGTTCGG 
ATATGGCTTT GCCAATTTTG 
TGGCTCAATT CAAAAATCTT 
ACTTACATTA GATACAAAAC 
TACGATTAGC GATATGTTGA 
TATCGTTACC GATGAAGAAA 
AGGAAAATAC AATTATTCCC 
GTATCAACGT ACCTATCAAG 
GAATTTTGGC TTCTATGAAA 
GACAGAAGAT AATTCATATA 



AAAGACATGG AGGAGTGAAT 
GTGGTTATTT AATAAAAACG 
AAGTGGTCTT GGAAGAAGAT 
TTGCACCAGA TAGTGGCGAA 
CTTTGATCAT TAAAGATGAT 
AAAAGCAACA AGCCAACACG 
TTACCACAAC CTTGATCTTA 
TCGCTACGTT TTATCCGCAA 
ATATGACAAG TGGTTTAAAG 
TTATTCAATT TGTTAAACAA 
CAGTAAATTT TGTCCTTTTA 
AATTATTTAA TAATCTTTAT 
CCTTATTGGA ACAGCCCAAT 
ACCAAGTGCT CTCAATTCCT 
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GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TTTAA 

EF131-2 (SEQ ID NO:490) 

MRKRH AKKRHGGVNW LFIVCLLWI GGSGYLIKTF FFTRDSQVSQ ESKWLEEDR 
RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
• SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 

EF131-3 (SEQ ID NO:491) 

TTT AATAAAAACG 

TTCTTTTTCA cTAGAGATTC ACAAGTTAGT CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTCGG CTTTGATCAT TAAAGATGAT 
CAGGTTTTAG TAAATAAAGG ATATGGCTTT GCCAATTTTG AAAAGCAACA AGCCAACACG 
CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA 
AAAGCAATTG AAGAAGGTAA AC TT AC ATT A GATACAAAAC TCGCTACGTT TTATCCGCAA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG 
TTATCAGCAA TGC CTAATAA TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC CAGTAAATTT TGTCCTTTTA 
GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT 
CACAAAACGG CTGGTTTAAA GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCAACAA GTTATAAATG GACAGAAGAT AATTCATATA ACCAAGTGCT CTCAATTCCT 
GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TT 

EF131-4 (SEQ ID NO:492) 

LIKTF FFTRDSQVSQ ESKWLEEDR 

RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 



EF132-1 (SEQ ID NO:493) 

TAGTTTTCTAATCTCACCAAAACAAAAATTTTTAAGAAAGAAGGAG 
GTGGGAAGTCTGGGAATGTTGATTGCTCTTTTTATATTCGGGGCATGT 



WO 98/50554 



PCT/US98/08959 



243 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

gcttcgaacgaaaaattaaaggtagtagttac 
attgatttacacagtatcgtacctattgggaaagatcc 
tcaaaagcagatttgattttttataacggtgttaacttggaira 
matgcgaacaaagaggaaaacaaagactattttgcagcaagtgatc 

gagaaagggaaggaagatccccatgcttggttaaatttagaaaacggtattatttacgctaaaaatattc 

ttagcggaaaaagatcctcataataaaaaattctat^^ 

gacaaagaagctaaatctaaatttgcttcaattccgaa 

tatttctcgaaagcgtataatgtgccttctgcttacatttgggaaa t caac ac tgaagaagaaggaac acc agatc aa 

ataaaacacttagttgaaaaattacgcacaacaaaagttcc^ 

atgaaaacagtatcaaaagataccaatattcctatctattc^ 

GATGGTGATAGTTACTATGCGATGATGAAATGGAACCTGGATAA 
EF132-2 (SEQ ID NO:494) 

^RKWKVWGSLGMLIALFIFGACSTNSKDKDTVASNEKLKV^^ 

LPEDVQKTSKADL IFYNGWLXTGGNAWFTKLVKXAI^ 1 1 

YAKNIEKWLAEKDPDNKKFYKENLDKYIEKLDSLDKEAKSKFAS I PNDKKMIVTSEGCFKYFSKAYNVPSAYIWE INT 
EEEGTPDQIKHLVEKLRTTKVPSLFVESSVDDRPMKWSKDTO^ 
GLSK. 

EF132-3 (SEQ ID NO:495) 

ATGTTCAACAAATAGTAAAGACAAAGATACAGTGGCTTCGAACGAAAAATTAAAGGTAGTAG 
AGCAGATATTACTGAAAATATAGCAAAAGATAAAATTGATTTACACAGTATCGTACCTATTGOT 
ATATGAACCtTTGCCTGAAGATGTTCAAAAAACTTC 
TGGAGGAAATGCTTGGTTTACAAAATTAGTAAAAmATGCG 

TGGCATAGATGTTATTTACTTAGAGGGTCAGAGTGAGAAAGGGAAGGAAGATCCCCATGCT^ 
CGGTATTATTTACGCTAAAAATATTGAAAAATG^ 

TCTAGATAAGTATATTGAAAAGTTGGATTCTCTAGACAAAGAAGCTAAATCTAAATTTC 
AAAAATGATTGTTACAAGTGAAGGATGCTTtAAATATT^ 

AAtCAACACTGAAGAAGAAGGAACACCAGATCAAATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTTCCCTC 
CTTATTCGTAGAAAGTAGTGTGGACGATAGACCGATGAAAACAGTATCAAAAGATACCAATATTCCTATCT^ 
GATTTTTACTGATTCAATTGCAGAAAAAGGACAAGATGGTGATAGTTACTATGCGATGATC 
AATTGCTGAAGGCCTTTCGAAA 



EF132-4 (SEQ ID NO:496) 

CSTNSKDKDTVASNEKLKVVVTNSILADITENIAKDKIDLHSIVPIGKDPHEYEPLPEDVQKTSKADLIFYNGVNLXT 
GGNAWFTKLVKXANKEENKDYFAASDG I DVI YLEGQSEKGKEDPHAWLNLENG 1 1 YAKNI EKWLAEKDPDNKKF YKEN 
LDKYIEKLDSLDKEAKSKFASIPNDKKMIVTSEGCFKYFS 

LFVESSVDDRPMKTVSKDTNI P I YSTIFTDS IAEKGQDGDSYYAMMKWNLDKIAEGLSK 
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Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBs 
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TABLE 3. Conservative Amino Acid Substitutions. 



PCT/US98/08959 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Asnarapine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 



EF001-2 


from about Asp-150 to about Lys-152, from about Ser-256 to about Tyr- 
z5y, rrom about Lys-360 to about Lys-363, from about Asn-406 to about 

/YSp-*fUo. 






brUUz-z 


rrom about Asp-oO to about Asp-83, from about Asp-281 to about Gly- 

ZOJ. 








irom about Asn-zoi to about Uly-zoo. 






brUU4-z 


irom about Asn-zi to about Asn-zo, rrom about Lys-83 to about Ser-87, 
from about Tyr-154 to about Asp-159. 






brUUD-z 


rrom about Lys-z49 to about ulu-252. 






brUuo-z 


rrom about uly-23 to about Asp-28. 






EF008-2 


from about Thr-92 to about Gly-94, from about Pro-161 to about Asp- 
165, from about Gly-287 to about Thr-289. 






EF010-2 


from about Pro- 129 to about Asn-131. 






EF012-2 


from about Asp-77 to about Asp-79, from about Asp-94 to about Lys-98, 
from about Asp-256 to about Thr-258, from about Glu-461 to about Asn- 
468. 






EF013-2 


from about Thr-30 to about Asp-32, from about Glu-73 to about Ala-75, 
from about Gin- 164 to about Asn- 166, from about Lys-193 to about Gly- 

1 C\C 

iy5. 






brul*4-z 


rrom about &er-2UJ to about Asp-206, from about (jln-314 to about Gly- 
316 






br015-2 


from about Pro-66 to about Gly-69. 






EF016-2 


from about Lys-236 to about Asn-239. 






brUl /-z 


irom about ber-90 to about uly-93, from about Tnr-197 to about Lys- 
199, from about Lys-230 to about Asn-233, from about Ser-428 to about 
Gly-431. 






EF018-2 


from about Lys-159 to about Tyr-161, from about Asn-165 to about Ser- 
167, from about Asn-250 to about Arg-256, from about Asn-392 to about 
Gly-395, from about Lys-416 to about Tyr-4 18, from about Asn-428 to 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 





about Arg-4 JU. 






EF019-2 


from about Arg-209 to about Ser-21 1, from about Lys-287 to about Ser- 
290. 






EF020-2 


from about Lys-57 to about Asn-62. 






EF021-2 


from about Ser-33 to about Gly-35, from about Glu-77 to about Gly-81, 
from about Asp- 139 to about Lys-141, from about Glu-255 to about Ser- 
258, from about Gln-271 to about Tyr-277. 






EF023-2 


from about Lys-232 to about Asp-234, from about Arg-304 to about Gly- 
306, from about Thr-453 to about Arg-456, from about Ser-478 to about 
Thr-480, 






EF025-2 


from about Arg-183 to about Asp-185. 






EF026-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gin- 1 07 to about Asn-l 10. 






EF027-2 


from about Gln-72 to about Lys-74, from about Lys-229 to about Asp- 
23 1. 






EF028-2 


from about Asp-186 to about Gln-188. 






hr029-2 


from about Asp-1 18 to about Lys-122, from about Asp- 124 to about 
i yr- Jzo. 






JirUJ I-z 


trom about uiu-ju to about oiy-j.J. 






EF034-2 


from about Glu-25 to about Gly-27, from about Glu-75 to about Tnr-77. 










br3o-2 


trom about Gin- 177 to about ber-179. 






EF037-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 

fWvm qUniii Clin 1 (Yl ir\ oVsm it A en 1 1fl 

irom aooui uin- 1 u / 10 aooui Asn- 1 1 u. 






EF038-2 


from about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92. 






EF040-2 


from about Lys-167 to about Gly- 172, from about Lys-240 to about 
Asn-242. 
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EF044-2 


from about Arg-192 to about Gly-194, from about Asn-200 to about Asn- 
203. 






EF045-2 


from about Asp-159 to about Asn-161, from about His-172 to about Gly- 
174, from about Tyr-261 to about Gly-264, from about Lys-305 to about 
Glu-308. 






EF046-2 


from about Ser-18 to about Gly-23, from about Gln-41 to about Ser-47, 
from about Thr-76 to about Asp-78. 






EF047-2 


from about Asn-28 to about Asp~30, from about Asp-273 to about Asn- 
277. 






EF048-2 


from about Asp-138 to about Lys-141, from about Asp-152 to about 
Gly-154. 






EF051-2 


from about Asp-73 to about Gly-76, 






EF053-2 


from about Ser-79 to about Gly-82. 






EF055-2 


from about Asp-26 to about Gly-28, from about Gln-67 to about Asp-69, 
from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. 






EF056-2 


from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. 










EF058-2 


from about Lys-129 to about Gly-133, from about Gln-571 to about Tyr- 
573, from about Pro-586 to about Gly-591 . 






EF065-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF066-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF067-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 







EF073-2 


from about Met-98 to about Arg-1 00, from about Arg-1 10 to about Asp- 
112, 






EF074-2 


from about Ser-53 to about Tyr-59, from about Ser-86 to about Gly-88, 
from about Pro-97 to about Gln-100, from about Gln-230 to about Gly- 
232. 






EF076-2 


from about Asn-38 to about Tyr-40, from about Asp-48 to about Asn-53, 
from about Lys-79 to about Gly-8 1 . 






EF077-2 


from about Arg-41 1 to about Gly-413. 






EF078-2 


from about Thr-294 to about Gly-296, from about Asp-366 to about Gln- 
368, from about Glu-524 to about Gly-526. 






EF080-2 


from about Glu-164 to about Gly-1 66, from about Ser-206 to about Tyr- 
208, from about Lys-239 to about Gly-243. 






EF081-2 


from about Asn-7 to about Ser-1 1, from about Lys-77 to about Tyr-80, 
from about Lys-1 12 to about Asn-1 14, from about Gly-1 62 to about Asp- 
164, from about Arg-1 81 to about Gly-183. 






EF083-2 


from about Gln-38 to about Arg-40. 






EF084-2 


from about Lys-1 40 to about Asp- 142, from about Gly-164 to about Arg- 
166, from about Arg-262 to about Gly-264. 






EF085-2 


from about Asn-95 to about Asp-97, from about Arg-1 12 to about Asp- 
1 14, from about Asp-258 to about Ser-260, from about Arg-40 1 to about 
Ser-403. 






EF086-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF087-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF088-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 
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EF090-2 


from about Arg-2 to about Arg-5. 






EF091-2 


from about Gln-40 to about Asp-43. 






EF093-2 


from about Lys-95 to about Gly-97. 






EF094-2 


from about Asp-314 to about Asp-316. 






EF095-2 


from about Ser-328 to about Thr-330, from about Asp-359 to about Asp- 
363, from about Glu-637 to about Gly-639, from about Asn-744 to about 
Gly-746. 






EF096-2 


from about Pro-128 to about Asn-130, from about Ser-193 to about Asp- 
196. 






EF097-2 


from about Val-357 to about Gly-359. 






EF099-2 


from about Glu-44 to about Asp-47, from about Lys-1 54 to about Gly- 
156, from about Asn-286 to about Asp-289. 






EF101-2 


from about Lys-40 to about Asp-42, from about Pro-255 to about Asn- 
258, from about Lys-288 to about Gly-290. 






EF102-2 


from about Asp-314 to about Asp-316. 






EF103-2 


from about Asn-46 to about Gly-48. 






EF 104-2 


from about Pro-232 to about Lys-237, from about Ala-362 to about Asn- 
366, from about Ser-421 to about Gly-423, from about Lys-488 to about 
Ser-490, from about Asp-550 to about Asn-552, from about Pro-637 to 
about Lys-640, from about Asp-727 to about Gly-729, from about Asn- 
751 to about Ser-754, from about Lys-771 to about Asn-774, from about 
Ile-835 to about Asn-837, from about Pro-851 to about Gly-853. 






EF105-2 


from about Ser-40 to about Gly-43, from about Asn-94 to about Gln-97, 
from about Gln-220 to about Gly-222, from about Asn-263 to about Gly- 
265. 






EF106-2 


from about Asp-72 to about Gly-75, from about Thr-274 to about Asp- 
277, from about Asn-310 to about Arg-3 13. 






EF107-2 


from about Thr-155 to about Asn-157, from about Thr-189 to about Asp- 
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191, from about Arg-270 to about Gly-272, from about Thr-330 to about 
Lys-335, from about Asp-365 to about Asp-368, from about Pro-451 to 
about Asp-453, from about Gly-485 to about Thr-488. 






EF108-2 


from about Lys-142 to about Trp-145, from about Thr-147 to about Tyr- 
150, from about Arg-2 12 to about Gly-214, from about Ser-248 to about 
Asp-251, from about Asp-384 to about Asp-387, from about Pro-481 to 
about Arg-483, from about Lys-491 to about Gly-494, from about Thr- 
619 to about Gly-624, from about Asp-656 to about Asp-659, from about 
Lys-717 to about Asn-721, from about Ser-822 to about Gly-824, from 
about Tyr-1 137 to about Thr-1 141 . 






EF110-2 


from about Pro- 123 to about Gly-127, from about Thr-223 to about Gly- 
225. 






EF111-2 


from about Lys-207 to about Asn-209, from about Asp-245 to about 
Asn-248, from about Lys-396 to about Asp-398, from about Glu-429 to 
about Ser-432, from about Thr-470 to about His-474. 






EF119-2 


from about Asp-90 to about Asn-92, from about Gin- 142 to about Gly- 
144. 






EF121-2 


from about Asn-159 to about Asp-161, from about Asn-351 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. 






EF122-2 


from about Asn-159 to about Asp-161, from about Asn-351 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. 






EF123-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp- 1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-191 9, from about Pro-1940 
to about Gly- 1942. 






EF124-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
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from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly- 1942. 






EF125-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp- 1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1 942. 






EF126-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF127-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF128-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF 129-2 


from about Asn-300 to about uly-302, rrorn about ber-ilo to about uiy- 
3 19, from about Asn-385 to about His-387 






EF131-2 


from about Lys-201 to about Tyr-204, from about Glu-263 to about Ser- 
266. 






EF 132-2 


from about Thr-26 to about Ser-28. 
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What Is Claimed Is: 

L An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table 1 ; or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary 
to any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
hybridizes under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence identical to a nucleotide sequence in (a) or (b) of claim 1 . 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
encodes an epitope-bearing portion of a polypeptide in (a) of claim 1 . 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion 
of a polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic 
acid molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing the host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 

(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1 ; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-terminal residue; 
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(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

1 1. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence selected from the group consisting of an amino acid sequence of any one of 
the polypeptides in Table 1. 

13. An isolated polypeptide antigen comprising an amino acid sequence of an E. 
faecalis epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide 
sequence encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

16. A vaccine, comprising: 

(1) one or more E. faecalis polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutical^ acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective 
antibodies in an animal to a member of the Enterococcus genus. 

17. A method of preventing or attenuating an infection caused by a member of the 
Enterococcus genus in an animal, comprising administering to said animal a polypeptide 
of claim 9, wherein said polypeptide is administered in an amount effective to prevent 
or attenuate said infection. 

18. A method of detecting Enterococcus nucleic acids in a biological sample 
comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1 , under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Enterococcus 
nucleic acid sequences present in the biological sample. 
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19. A method of detecting Enterococcus nucleic acids in a biological sample obtained 
from an animal, comprising: 

(a) amplifying one or more Enterococcus nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Enterococcus nucleic acid. 

20. A kit for detecting Enterococcus antibodies in a biological sample obtained from an 
animal, comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

21. A method of detecting Enterococcus antibodies in a biological sample obtained 
from an animal, comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 
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